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THE INTEL Pentium™ PROCESSOR 
A TECHNICAL OVERVIEW 


Intel’s new Pentium processor combines the perform- 
ance traditionally associated with minicomputers and 
workstations with the flexibility and compatibility that 
characterize the personal computer platform. Designed 
to meet the needs of today’s and tomorrow’s sophisti- 
cated software applications, the Pentium processor ex- 
tends the range of Intel’s microprocessor architecture 
to new heights, blurring previous distinctions between 
hardware platforms and creating an entirely new realm 
of possibilities for desktop computers and servers. 


This paper begins by presenting an overview of the Pen- 
tium processor, after which it provides details on the 
key technological features that enable this new Intel 
solution to meet the market’s evolving requirements for 
high performance, continued software compatibility 
and advanced functionality. All features are explored in 
the context of how they benefit today’s high-end desk- 
top computer and server users, as well as those users 
who can now migrate from other platforms to take ad- 
vantage of the Pentium processor’s performance advan- 
tages. 


Intel Pentium 


November 1993 
Order Number: 241610-002 


INTEL’S NEXT GENERATION OF 
POWER 


Incorporating more than 3.1 million transistors on a 
single piece of silicon, the 32-bit Pentium processor fea- 
tures high-performance clock speeds of 60 MHz and 
66 MHz. Its superscalar architecture utilizes advanced 
design techniques that enable it to execute more than 
one instruction per clock cycle, resulting in the ability 
to run today’s huge base of PC-compatible software 
faster than any other microprocessor. Beyond this ex- 
isting software base, the Pentium processor’s high-per- 
formance floating-point unit provides the advanced 
computing power necessary to tackle demanding tech- 
nical and scientific applications previously reserved for 
the workstation platform. And as local-area and wide- 
area networks (LANs and WANs) continue to replace 
the mainframe-driven hierarchical networks of the past, 
the Pentium processor’s multiprocessing performance 
and operating system flexibility are ideal for the host of 
new client/server applications that are appearing across 
the industry. 


Processor 
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While the Pentium processor can achieve performance 
levels equal to or better than that of today’s high-end 
workstations, it enjoys an advantage that none of its 
workstation counterparts can claim: complete compati- 
bility with the more than 50,000 software applica- 
tions—worth billions of dollars—that have been writ- 
ten for the Intel architecture. In addition, the Pentium 
processor runs all of the advanced operating systems 
now available for today’s desktop PCs, workstations 
and servers, including UNIX*, Windows* and OS/2*. 
This provides users and systems manufacturers with 
the flexibility that today’s heterogenous networked dis- 
tributed computing environments require. 


THE PENTIUM PROCESSOR: 
TECHNICAL INNOVATIONS 


A number of innovative product features contribute to 
the Pentium processor’s unique combination of high 
performance, compatibility, data integrity and upgrad- 
ability. These include: 


¢ Superscalar architecture 

¢@ Separate code and data caches 

e Branch prediction 

¢ High-performance floating-point unit 

e Enhanced 64-bit data bus 

® Multiprocessing support 

© Memory page size option 

Error detection and functional redundancy 
Performance monitoring 


Intel OverDrive™ processor upgradability 


Superscalar Architecture 


The Pentium processor’s superscalar architecture is the 
only Intel-compatible dual-pipelined architecture in the 
industry, enabling the processor to achieve new levels 
of performance by executing more than one instruction 
per clock cycle. The term “superscalar” refers to a mi- 
croprocessor architecture that contains more than one 
execution unit. These execution units—or pipelines— 
are where the chip processes the data and instructions 
that are fed to it by the rest of the system. 


The Pentium processor’s superscalar implementation 
represents a natural progression from previous genera- 
tions of CPUs in the 32-bit Intel architecture. The In- 
tel486™ CPU, for example, was able to execute many 
of its instructions in one clock cycle, while previous 
generations of Intel microprocessors required multiple 
clock cycles to execute a single instruction. 
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This ability to execute multiple instructions per clock 
cycle is due to the fact that the Pentium processor’s two 
pipelines can execute two instructions simultaneously. 
As with the Intel486 CPU’s single pipeline, the Penti- 
um processor’s dual pipelines execute integer instruc- 
tions in five stages: prefetch, decode 1, decode 2, execute 
and writeback. This permits several instructions to be in 
various stages of execution, thus increasing processing 
performance. Each pipeline has its own arithmetic logic 
unit (ALU), address generation circuitry and data 
cache interface. 


As with the Intel486 CPU, the Pentium processor uses 
hardwired instructions to replace many of the micro- 
coded instructions used in previous microprocessor 
generations. These instructions include loads, stores, 
and simple ALU operations which can be executed by 
the processor’s hardware without requiring microcode. 
This improves performance without affecting compati- 
bility. In the case of more complex instructions, the 
Pentium processor’s enhanced microcode further 
boosts performance by employing both of the supersca- 
lar architecture’s dual integer pipelines to execute in- 
structions. 


The result of these architectural innovations is that, 
compared to previous microprocessor implementations, 
many more instructions can be executed in a specified 
amount of time (see the iCOMP™ ratings on page 6 for 
relative performance comparisons among Intel proces- 
sors). 


Separate Code and Data Caches 


Another significant advancement pioneered by the Pen- 
tium processor is its innovative cache implementation. 
Caches increase performance by acting as temporary 
storage places for commonly used code and data ob- 
tained from slower memory, replacing the need to go 
off-chip to the system’s main memory to fetch instruc- 
tions. The Intel486 CPU, for example, contains a single 
8-Kbyte on-chip cache to handle both code and data 
caching functions. Intel designers improved on this im- 
plementation by using the additional circuitry provided 
by the Pentium processor’s 3.1 million transistors 
(compared to the Intel486 CPU’s 1.2 million transis- 
tors) to create separate on-chip code and data caches. 
This improves performance by reducing bus conflicts 
and making the two caches available more often when 
they are needed. 


The Pentium processor’s code and data caches each 
contain 8 Kbytes of information, and both are orga- 
nized as two-way set associative caches—meaning that 
they save time by searching only pre-specified 32-byte 
segments, rather than the entire cache. All of these per- 
formance-enhancing features are in turn supplemented 
by the Pentium processor’s 64-bit internal data bus, 
which ensures that the dual caches and superscalar exe- 
cution pipelines are continually supplied with data. 
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In addition, the data cache uses two other important 
techniques: “‘writeback” caching and an algorithm 
called the MESI (Modified, Exclusive, Shared, Invalid) 
protocol. Writeback caches complete writes in the 
cache without going out to main memory as in previ- 
ous-generation “write-through” cache implementa- 
tions. This technique increases performance by reduc- 
ing bus utilization and preventing needless bottlenecks 
in the system. And the MESI protocol ensures that 
data in the cache and in main memory is consistent— 
an important consideration in advanced multiprocess- 
ing systems where different processors often need to 
operate on the same data. 


Branch Prediction 


Branch prediction is an advanced computing technique 
that boosts performance by keeping execution pipelines 
full, based on predetermining the most likely set of in- 
structions to be executed. The Intel Pentium processor 
is the first and only PC-compatible processor to use 
branch prediction, which until now has traditionally 
been associated with the mainframe computing plat- 
form. 


For a better understanding of this concept, consider a 
typical application program. After each pass through a 
software loop. the program performs a conditional test 
to determine whether to return to the beginning of the 
loop or to exit and continue on to the next execution 
step. These two choices, or paths, are called branches. 
Branch prediction forecasts which branch the software 
will require, based on the assumption that the previous 
branch that was taken will be used again. The Pentium 
processor makes branch predictions using a branch tar- 
get buffer (BTB) Unlike alternative architectures, this 
software-transparent innovation eliminates the need for 
recompiling code, thus increasing overall speed and ap- 
plication software performance. 


High-Performance Floating-Point Unit 


The emerging wave of 32-bit software applications in- 
cludes many compute-intensive, graphically oriented 
programs that require a high degree of floating-point 
processing power to handle mathematical calculations. 
As the floating-point requirements of personal comput- 
er software have steadily increased, advances in micro- 
processor technology have been introduced to satisfy 
these needs. The Intel486 DX CPU, for example, was 
the first Intel microprocessor to integrate math coproc- 
essing functions on-chip; previous-generation Intel 
processors used off-chip math coprocessors when float- 
ing-point calculations were required. 
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The Pentium processor takes math computational abili- 
ty to the next performance level by using an enhanced 
on-chip floating-point unit that incorporates sophisti- 
cated seven-stage pipelining and hardwired functions. 
A three-stage floating-point instruction pipeline is ap- 
pended to the integer pipelines. Most floating-point in- 
structions begin execution in one of the integer pipe- 
lines, then move on to the floating-point pipeline. Com- 
mon floating-point functions—such as add, multiply 
and divide—are hardwired for faster execution. 


As a result of these innovations, the Pentium processor 
executes floating-point instructions five to ten times 
faster than the 33-MHz Intel486 DX CPU, optimizing 
it for the high-speed numeric calculations inherent in 
advanced visual applications such as CAD and 3D 
graphics. 


Enhanced 64-Bit Data Bus 


The data bus is the highway that carries information 


between the processor and the memory subsystem. Be- 
cause it features a 64-bit data bus, the Pentium proces- 
sor has a substantially higher bus bandwidth than the 
Intel486 DX CPU—528 MBytes/sec at 66 MHz, com- 
pared to 160 MB/sec at 50 MHz for the Intel486 DX 
microprocessor. This wider data bus facilitates high- 
speed processing by maintaining the flow of instruc- 
tions and data to the processor’s superscalar execution 
unit, resulting in much higher overall performance for 
the Pentium processor when compared to the Intel486 
CPU. 


In addition to having a wider data bus, the Pentium 
processor implements bus cycle pipelining to increase 
bus bandwidth. Bus cycle pipelining allows a second 
cycle to start before the first one is completed. This 
gives the memory subsystem more time to decode the 
address, which allows slower and less-expensive memo- 
ry components to be used—resulting in a lower overall 
system cost. Burst reads and writes, parity on address 
and data, and a simple cycle identification all contrib- 
ute to providing better bandwidth and improved system 
reliability. 


Multiprocessing 


The Pentium processor is ideal for the increasing wave 
of multiprocessing systems and the greater levels of per- 
formance and scalability they provide for today’s com- 
puting arena. Multiprocessing applications that com- 
bine two or more Pentium processors are well served by 
the chip’s advanced architecture, separate on-chip code 
and data caches, chip sets for controlling external 
caches and sophisticated data integrity features. 
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As previously discussed, the Pentium prucessor main- 
tains cache consistency with its MESI protocol. When 
one processor accesses data that is cached in another 
processor, it is ensured of receiving the correct data. 
And if data is modified, all processors accessing that 
data receive the modified version. The new Intel Penti- 
um processor also ensures that instructions are seen by 
the system in the order that they were programmed. 
This strong ordering helps software designed to run on 
a single-processor system to work correctly in a multi- 
processing environment. 


Memory Page Size Option 


The Pentium processor offers the option of supporting 
either the traditional memory page size of 4 KBytes, or 
a larger, 4-MByte page. This optional feature was pro- 
vided to reduce the frequency of page swapping in com- 
plex graphics applications, frame buffers, and operating 
system kernels, where the increased page size now al- 
lows users to map large, previously unwieldy objects. 
An increased page hit rate results in higher perform- 
ance, all of which is transparent to the application soft- 
ware. 


Error Detection and Functional 
Redundancy 


Internal Error Detection and 
Functional Redundancy Check (FRC) 


Pentium™ Processor Pentium™ Processor 
Master Checker 


Outputs Inputs 
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Protecting important data and ensuring its integrity 
throughout the entire enterprise has become increasing- 
ly important as mission-critical applications continue to 
proliferate across today’s client/server environments. 
The Pentium processor incorporates two advanced fea- 
tures traditionally associated with mainframe-class de- 
signs—internal error detection and functional redun- 
dancy checking (FRC)—to help preserve data integrity 
in today’s evolving desktop-based networks. 
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Internal error detection places parity bits on the inter- 
nal code and data caches, translation look aside buffers, 
microcode, and branch target buffer, helping to detect 
errors in a manner that remains transparent to both the 
user and the system. Functional redundancy checking, 
meanwhile, is optimized for mission-critical applica- 
tions where two Pentium processors can operate in a» 
master/checker configuration. If a discrepancy is dis- 
covered between the two processors, the system is noti- 
fied. 


Performance Monitoring 


Performance monitoring is a feature of the Pentium 
processor that enables system designers and application 
developers to optimize their hardware and software 
products by identifying potential code bottlenecks. De- 
signers can observe and count clocks for internal proc- 
essor events that affect the performance of data reads 
and writes, cache hits and misses, interrupts, and bus 
utilization. This allows them to measure the effect that 
their code has on both the Pentium processor architec- 
ture and their product and to fine-tune their application 
or system for optimal performance. The benefit to end 
users is better value and higher performance, due to the 
greater synergy between the Pentium processor, its host 
system, and application software. 


Upgradability 


As with all new implementations of the Intel 32-bit mi- 
croprocessor architecture, the Pentium processor has 
been designed for easy upgradability using Intel’s up- 
grade technology. This innovation protects user invest- 
ments by adding performance that helps to maintain 
the productivity levels of Intel processor-based systems 
over their entire lifespans. 


Upgrade technology makes it possible for users to take 
advantage of more advanced processor technology in 
their existing systems with an easy-to-install, single- 
chip performance upgrade. For example Intel’s first up- 
grade options, the OverDrive™ processors developed 
for Intel486 SX and Intel486 DX CPUs, apply the 
same speed-doubling technology used in the develop- 
ment of the Intel486 DX2 microprocessor. 


By installing one of these upgrade processors in the 
socket located next to the CPU on many Intel486 proc- 
essor-based system motherboards, users can enhance 
overall system performance by as much as 70 percent 
across all software applications. And while the proces- 
sor’s speed is substantially increased, it still operates 
externally at its original clock rate, enabling it to inter- 
face seamlessly with the rest of the system and its asso- 
ciated peripherals. 
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Intel’s upgrade processors will also be available for sys- 
tems based on the Pentium processor family, ensuring 
an easy future upgrade path based on even more ad- 
vanced processor technology. In addition, Pentium 
processor technology will be the basis of the upgrade 
processor now being developed for Intel486 DX2 CPU- 
based systems. 


Increased Performance: By the 
Numbers 


The Pentium processor stands alone on the perform- 
ance ladder when compared to all other PC-compatible 
microprocessors, raising the standard for the Intel 32- 
bit architecture. While there are many ways to measure 
performance, three different examples offered here 
demonstrate the Pentium processor’s speed and pro- 
cessing power when compared to other alternatives: the 
SPECint92 and SPECfp92 UNIX benchmarks, and In- 
tel’s iCOMP™ index. 
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SPECint92 (Figure 1) is a processor-intensive UNIX 
benchmark that evaluates high-end desktop perform- 
mance using a representative mix of application in- 
structions. With a SPECint92 rating of 67.4, the 
66-MHz Pentium processor outperforms many work- 
station-class, RISC-based processors, including mem- 
bers of the IBM, MIPS and Sun SPARC processor 
families. 


The SPECfp92 UNIX benchmark (Figure 2) is a useful 
measure of floating-point performance. The 66-MHz 
Pentium processor’s SPECfp92 rating of 63.6 is compa- 
rable to that of today’s RISC architecture and more 
than 3.5 times that of the Intel486 DX2-66 CPU— 
which previously offered the highest floating-point per- 
formance of any processor compatible with the huge 
base of today’s industry-standard PC application soft- 
ware. 


SPECint92 Performance Comparison 
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Figure 1 


SPECfp92 UNIX Performance Comparison 
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The Intel ICOMP Index 


Pentium™ 
Processor-66 


Pentium. 
Processor-60 


1486™ DX2-66 
i486 DX-50 
1486 DX2-50 
i486 DX-33 
i486 SX-33 
1486 DX-25* 
i486 SX-25 
1486 SX-20 
i386™ DX-33 
1386 SX-33 
1386 DX-25 
i386 SL-25 
1386 SX-25 


1386 SX-20 


* The Intel486 SL CPU in a mobile configuration performs the 
same as the Intei486 DX CPU in a mobile configuration. 


Source: iCOMP™: A Simplified Measure of Relative Intel Microprocessor Performance 
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Figure 3 


Another indication of the Pentium processor’s high 
performance is provided by Intel’s iCOMP (Intel COm- 
parative Microprocessor Performance) index. The 
iCOMP index (Figure 3), which measures the perform- 
ance of the members of the Intel 32-bit architecture, 
was created so that computer users can more easily 
identify relative performance differences among Intel’s 
microprocessors. The index is based on four distinct 
aspects of CPU performance: integer, floating-point, 
graphics and video performance. Each of the four ele- 
ments is considered for both 16-bit and 32-bit software, 


Intel is a registered trademark of Intel Corporation. 


and each is weighted relative to the estimated percent- 
age of time it occupies the processor’s attention, based 
on a mix of today’s commonly used application soft- 
ware. 


Operating at 66 MHz, the Pentium processor has an 
iCOMP rating of 567. This relative performance is 
nearly twice that of the previous-generation high-end 
solution, the 66-MHz Intel486 DX2 microprocessor, 
which has an iCOMP rating of 297. 


iCOMP™, Intel386™, Intel486™, OverDrive™ and Pentium™ are trademarks of Intel Corporation. 
OS/2™ is a trademark of International Business Machines Corporation. 


Windows™ is a trademark of Microsoft Corporation. 
UNIX® is a registered trademark of UNIX System Labs. 
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athe m Increased Page Size 
@ 32-bit Microprocessor — 4M for Increased TLB Hit Rate 
— 32-Bit Addressing | 
— 64-Bit Data Bus m Multi-Processor Support 
S lar Avetiaet — Multiprocessor Instructions 
gpg sabierted tapine aecaenes — Support for Second Level Cache 
— Two Pipelined Integer Units 
— Capable of under One Clock per m@ Internal Error Detection 
Instruction | — Functional Redundancy Checking 
— Pipelined Floating Point Unit = “a i ict Test Se 
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— 8K Code, 8K Write Back Data @ IEEE 1149.1 Boundary Scan 
— 2-Way 32-Byte Line Size : Compatibility 
— Software Transparent m Performance Monitoring 
— MESI Cache Consistency Protocol — Counts Occurrence of Internal 
m Advanced Design Features Events 
— Branch Prediction — Traces Execution through Pipelines 


— Virtual Mode Extensions 


The Pentium processor provides the next generation of power for high-end workstations and servers. The 
Pentium processor is compatible with the entire installed base of applications for DOS, Windows, OS/2, and 
UNIX. The Pentium processor’s superscalar architecture can execute two instructions per clock cycle. Branch 
Prediction and separate caches also increase performance. The pipelined floating point unit of the Pentium 
processor delivers workstation level performance. Separate code and data caches reduce cache conflicts 
while remaining software transparent. The Pentium processor has 3.1 million transistors and is built on Intel’s 
0.8 Micron BiCMOS silicon technology. 


MS-DOS and Windows are registered trademarks of Microsoft Corporation. 
OS/2 is a trademark of International Business Machines Corporation. 
UNIX is a registered trademark of UNIX System Laboratories, Inc. 


November 1993 
Order Number: 241595-002 2-1 


Pentium™ Processor 


CONTENTS PAGE 
1.0 MICROPROCESSOR 
ARCHITECTURE OVERVIEW ......... 2-3 
BAPE asd vresswd condos sieves tes 2-6 
2.1 Pinout and Pin Descriptions ........ 2-6 
2.1.1 Pentium™ Processor 
PME sa Secon nek Wak ves s Ee. 2-6 
Oe LIORIONY INOROS: ccna oF ier ch «oeeeuey t 2-9 
2.3 Quick Pin Reference ............... 2-9 
2.4 Pin Reference Tables ............. 2-16 
2.5 Pin Grouping According to 
PUPRGHOID kick elegans annie aanetees 2-18 
2.6 Output Pin Grouping According to 
WIGUIDAVGR ociscaes i dx nke spats > 2-18 
2-2 


CONTENTS PAGE 
3.0 ELECTRICAL SPECIFICATIONS .... 2-18 
3.1. Power and Ground 0.355%. 42.08. 2-18 
3.2 Decoupling Recommendations .... 2-19 
3.3 Connection Specifications ........ 2-19 
3.4 Maximum Ratings ................ 2-19 
3.5 D.C. Specifications ............... 2-20 
3.6 A.C. Specifications ................ 2-20 
4.0 MECHANICAL SPECIFICATIONS ... 2-29 
5.0 THERMAL SPECIFICATIONS ....... 2-31 


intel. 


1.0 MICROPROCESSOR 
ARCHITECTURE OVERVIEW 


The Pentium™ processor is the next generation 
member of the Intel886™ and Intel486™ micro- 
processor family. It is 100% binary compatible with 
the 8086/88, 80286, Intel386 DX CPU, Intel886 SX 
CPU, Intel486 DX CPU, Intel486 SX and the Intel486 
DX2 CPUs. 


The Pentium processor contains all of the features 
of the Intel486 CPU, and provides significant en- 
hancements and additions including the following: 

e Superscalar Architecture 

e Dynamic Branch Prediction 

e Pipelined Floating-Point Unit 

e Improved Instruction Execution Time 

e Separate 8K Code and Data Caches 

e Writeback MESI Protocol in the Data Cache 

e 64-Bit Data Bus 

e Bus Cycle Pipelining 

e Address Parity 

e Internal Parity Checking 

e Functional Redundancy Checking 

e Execution Tracing 

e Performance Monitoring 

e [EEE 1149.1 Boundary Scan 

e System Management Mode 

e Virtual Mode Extensions 


The application instruction set of the Pentium proc- 
essor includes the complete Intel486 CPU instruc- 
tion set with extensions to accommodate some of 
the additional functionality of the Pentium processor. 
All application software written for the Intel886 and 
Intel486 microprocessors will run on the Pentium 
processor without modification. The on-chip memory 
management unit (MMU) is completely compatible 
with the Intel386 and Intel486 CPUs. 


The Pentium processor implements _ several 
enhancements to increase performance. The two 
instruction pipelines and floating-point unit on the 
Pentium processor are capable of independent op- 
eration. Each pipeline issues frequently used instruc- 
tions in a single clock. Together, the dual pipes can 
issue two integer instructions in one clock, or one 
floating point instruction (under certain circum- 
stances, 2 floating point instructions) in one clock. 
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Branch prediction is implemented in the Pentium 
processor. To support this, the Pentium processor 
implements two prefetch buffers, one to prefetch 
code in a linear fashion, and one that prefetches 
code according to the BTB so the needed code is 
almost always prefetched before it is needed for ex- 
ecution. 


The floating-point unit has been completely rede- 
signed over the Intel486 CPU. Faster algorithms pro- 
vide up to 10X speed-up for common operations in- 
cluding add, multiply, and load. 


The Pentium processor includes separate code and 
data caches integrated on chip to meet its perform- 
ance goals. Each cache is 8 Kbytes in size, with a 
32-byte line size and is 2-way set associative. Each 
cache has a dedicated Translation Lookaside Buffer 
(TLB) to translate linear addresses to physical ad- 
dresses. The data cache is configurable to be write- 
back or writethrough on a line by line basis and fol- 
lows the MESI protocol. The data cache tags are 
triple ported to support two data transfers and an 
inquire cycle in the same clock, The code cache is 
an inherently write protected cache. The code cache 
tags are also triple ported to support snooping and 
split line accesses. Individual pages can be config- 
ured as cacheable or non-cacheable by software or 
hardware. The caches can be enabled or disabled 
by software or hardware. 


The Pentium processor has increased the data bus 
to 64-bits to improve the data transfer rate. Burst 
read and burst writeback cycles are supported by 
the Pentium processor. In addition, bus cycle pipelin- 
ing has been added to allow two bus cycles to be in 
progress simultaneously. The Pentium processor 
Memory Management Unit contains optional exten- 
sions to the architecture which allow 4 Mbyte page 
sizes. 


The Pentium processor has added significant data 
integrity and error detection capability. Data parity 
checking is still supported on a byte by byte basis. 
Address parity checking, and internal parity checking 
features have been added along with a new excep- 
tion, the machine check exception. In addition, the 
Pentium processor has implemented functional re- 
dundancy checking to provide maximum error detec- 
tion of the processor and the interface to the proces- 
sor. When functional redundancy checking is used, a 
second processor, the “checker” is used to execute 
in lock step with the ‘master’ processor. The 
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checker samples the master’s outputs and com- 


pares those values with the values it computes inter-_ . 


nally, and asserts an error signal if a mismatch oc- 
curs. 


As more and more functions are integrated on chip, 
the complexity of board level testing is increased. To 
address this, the Pentium processor has increased 
test and debug capability. Like many of the Intel486 
CPUs, the Pentium processor implements IEEE 
Boundary Scan (Standard 1149.1). In addition, the 
Pentium processor has specified 4 breakpoint pins 
that correspond to each of the debug registers and 
externally indicate a breakpoint match. Execution 


— 


a : | 

intel. 
tracing provides external indications when an in- 
struction has completed execution in either of the 


two internal pipelines, or when a branch has been 
taken. 


System management mode has been implemented 
along with some extensions to the SMM architec- 
ture. Enhancements to the Virtual 8086 mode have 
been made to increase performance by reducing the 
number of times it is necessary to trap to a virtual 
8086 monitor. 


Figure 1-1 shows a block diagram of the Pentium 
processor. 
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Processor Block Diagram 
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The block diagram shows the two instruction pipe- 
lines, the “u” pipe and the “‘v” pipe. The u-pipe can 
execute all integer and floating point instructions. 
The v-pipe can execute simple integer instructions 
and the FXCH floating point instructions. 


The separate caches are shown, the code cache 
and data cache. The data cache has two ports, one 
for each of the two pipes (the tags are triple ported 
to allow simultaneous inquire cycles). The data 
cache has a dedicated Translation Lookaside Buffer 
(TLB) to translate linear addresses to the physical 
addresses used by the data cache. 


The code cache, branch target buffer and prefetch 
buffers are responsible for getting raw instructions 
into the execution units of the Pentium processor. 
Instructions are fetched from the code cache or 
from the external bus. Branch addresses are re- 
membered by the branch target buffer. The code 
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cache TLB translates linear addresses to physical 
addresses used by the code cache. 


The decode unit decodes the prefetched instruc- 
tions so the Pentium processor can execute the in- 
struction. The control ROM contains the microcode 
which controls the sequence of operations that must 
be performed to implement the Pentium processor 
architecture. The control ROM unit has direct control 
over both pipelines. 


The Pentium processor contains a pipelined floating 
point unit that provides a significant floating point 
performance advantage over previous generations 
of the Pentium processor. 


The architectural features introduced in this chapter 


are more fully described in the Pentium™ Processor 
User’s Manual. 
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2.1 Pinout and Pin Descriptions 


2.1.1 Pentium™ PROCESSOR PINOUT 
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Figure 2-1. Pentium™ Processor Pinout (Top View) 
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Figure 2-2. Pentium™ Processor Pinout (Bottom View) 
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Table 2-1. Pentium™ Processor Pin Cross Reference Table by Pin Name 


| Signal_| Location | 
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Table 2-1. Pentium™ Processor Pin Cross Reference Table by Pin Name (Continued) 
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2.2 Design Notes 


For reliable operation, always connect unused in- 
puts to an appropriate signal level. Unused active 
LOW inputs should be connected to Vcc. Unused 
active HIGH inputs should be connected to GND. 


No Connect (NC) pins must remain unconnected. 
Connection of NC pins may result in component fail- 
ure or incompatibility with processor steppings. 


NOTE: 

The No Connect pin located at LO3 (BRDYC#) 
along with BUSCHK# are sampled by the Pentium 
processor at RESET to configure the I/O buffers of 
the processor for use with the 82496 Cache Con- 
troller/82491 Cache SRAM secondary cache as.a 
chip set (refer to the 82496 Cache Controller/ 
82491 Cache SRAM Data Book for Use with the 
Pentium™ Processor for further information). 
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LO3, NO4, 
Q19, R19, 
$19, T18 


A04, AO5, 
A06, A07, 
A008, A11, 
A12, A13, 
A14, A15, 
A16, A17, 
A18, C01, 
DO1, E01, 
FOt; F21, 
G01, G21, 
HO1, J21 
K21, L214, 
M21, NO1, 
N21, P01, 
P21, Q01, 
Q18, Q21, 
R01, R21, 
$01, T01, 
U01, W06, 
W07, WO8, 
Wwo9, W10, 


BO5, BO6, 
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B07, BO8, | 


B11, B12, 
B13, B14, 
B15, B16, 
B17, B18, 
E02, F02, 
G02, G20, 
HO2, H20, 
J01, J20, 
K01, KO2, 
K20, LO1, 
L20, M01, 
M20, NO2, 
N20, P02, 
P20, Q02, 
Q20, R02, 
R20, S02, 
T02, VO7, 
V08, VO9, 
V10, V11, 
V12, V13, 
V14, V15, 
V16, V17, 
V1i8 


W11, W12, 
W113, W14, 
W15, W16, 
W17, W18 


2.3 Quick Pin Reference 


This section gives a brief functional description of 
each of the pins. For a detailed description, see the 
Hardware Interface chapter in the Pentium™ Proc- 
essor User’s Manual, Vol. 1. Note that all input 
pins must meet their AC/DC specifications to 
guarantee proper functional behavior. In this sec- 
tion, the pins are arranged in alphabetical order. The 
functional grouping of each pin is listed at the end of 
this chapter. 


The # symbol at the end of a signal name indicates 
that the active, or asserted state occurs when the 
signal is at a low voltage. When a # symbol is not 
present after the signal name, the signal is active, or 
asserted at the high voltage level. 


2-9 


PENTIUM™ PROCESSOR intel 
Table 2-2. Quick Pin Reference 


| Symbol __| Type’ Name and Function 
A20M# When the address bit 20 mask pin is asserted, the Pentium™ Processor 
emulates the address wraparound at one Mbyte which occurs on the 8086. 
When A20M # is asserted, the Pentium processor masks physical address bit 
20 (A20) before performing a lookup to the internal caches or driving a memory 
cycle on the bus. The effect of A20M # is undefined in protected mode. 
A20M# must be asserted only when the processor is in real mode. 
A31-A3 I/O 
define the physical area of memory or I/O accessed. The external system 
drives the inquire address to the processor on A31-—A5. 
ADS # , The address status indicates that a new valid bus cycle is currently being 
driven by the Pentium processor. 
In response to the assertion of address hold, the Pentium processor will stop 
driving the address lines (A31-—A3), and AP in the next clock. The rest of the 
bus will remain active so data can be returned or driven for previously issued 
bus cycles. 


As outputs, the address lines of the processor along with the byte enables 


Address parity is driven by the Pentium processor with even parity information 
on all Pentium processor generated cycles in the same clock that the address 
is driven. Even parity must be driven back to the Pentium processor during 
inquire cycles on this pin in the same clock as EADS # to ensure that the 
correct parity check status is indicated by the Pentium processor. 


The address parity check status pin is asserted two clocks after EADS # is 
sampled active if the Pentium processor has detected a parity error on the 
address bus during inquire cycles. APCHK # will remain active for one clock 
each time a parity error is detected. 


The byte enable pins are used to determine which bytes must be written to 
external memory, or which bytes were requested by the CPU for the current 
cycle. The byte enables are driven in the same clock as the address lines 

(A31-3). 


The backoff input is used to abort all outstanding bus: cycles that have not yet 
completed. In response to BOFF #, the Pentium processor will float all pins 
normally floated during bus hold in the next clock. The processor remains in 
bus hold until BOFF # is negated at which time the Pentium processor restarts 
the aborted bus cycle(s) in their entirety. 


The breakpoint pins (BP3-0) correspond to the debug registers, DR3—DRO. 
These pins externally indicate a breakpoint match when the debug registers 
are programmed to test for breakpoint matches. 


BP1 and BPO are multiplexed with the Performance Monitoring pins (PM1 and 
PMO). The PB1 and PBO bits in the Debug Mode Control Register determine if 
the pins are configured as breakpoint or performance monitoring pins. The pins 
come out of reset configured for performance monitoring. 


PM/BP[1:0] 


The burst ready input indicates that the external system has presented valid 
data on the data pins in response to a.read or that the external system has 
accepted the Pentium processor data in response to a write request. This 
signal is sampled in the T2, T12 and T2P bus states. 


The bus request output indicates to the external system that the Pentium 
processor has internally generated a bus request. This signal is always driven 
whether or not the Pentium processor is driving its bus. 


The branch trace outputs provide bits 2-0 of the branch target linear address 
(BT2-BTO) and the default operand size (BT3) during a branch trace message 
special cycle. 
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Table 2-2. Quick Pin Reference (Continued) 


[moor [tyes] Namwandrunction 


BUSCHK # The bus check input allows the system to signal an unsuccessful completion of a 
bus cycle. If this pin is sampled active, the Pentium processor will latch the 
CACHE # ae 


address and control signals in the machine check registers. If in addition, the MCE 
bit in CR4 is set, the Pentium processor will vector to the machine check 
exception. 


For Pentium processor-initiated cycles the cache pin indicates internal 
cacheability of the cycle (if a read), and indicates a burst writeback cycle (if a 
write). If this pin is driven inactive during a read cycle, Pentium processor will not 
cache the returned data, regardless of the state of the KEN # pin. This pin is also 
used to determine the cycle length (number of transfers in the cycle). 


The clock input provides the fundamental timing for the Pentium processor. Its 
frequency is the internal operating frequency of the Pentium processor and 

requires TTL levels. All external timing parameters except TDI, TDO, TMS and 
TRST # are specified with respect to the rising edge of CLK. 


The Data/Code output is one of the primary bus cycle definition pins. It is driven 
valid in the same clock as the ADS # signal is asserted. D/C # distinguishes 
between data and code or special cycles. 


These are the 64 data lines for the processor. Lines D7-D0 define the least 
significant byte of the data bus; lines D63-D56 define the most significant byte of 
the data bus. When the CPU is driving the data lines, they are driven during the 
T2, T12, or T2P clocks for that cycle. During reads, the CPU samples the data bus 
when BRDY # is returned. 


These are the data parity pins for the processor. There is one for each byte of the 
data bus. They are driven by the Pentium processor with even parity information 
on writes in the same clock as write data. Even parity information must be driven 
back to the Pentium processor on these pins in the same clock as the data to 
ensure that the correct parity check status is indicated by the Pentium processor. 
DP7 applies to D63-—D56, DPO applies to D7-D0. 


This signal indicates that a valid external address has been driven onto the 
Pentium processor address pins to be used for an inquire cycle. 


The external write buffer empty input, when inactive (high), indicates that a write 
cycle is pending in the external system. When the Pentium processor generates a 
write, and EWBE # is sampled inactive, the Pentium processor will hold off all 
subsequent writes to all E or M-state lines in the data cache until all write cycles 
have completed, as indicated by EWBE # being active. 


The floating point error pin is driven active when an unmasked floating point error 
occurs. FERR # is similar to the ERROR # pin on the Intel3887™ math 
coprocessor. FERR # is included for compatibility with systems using DOS type 
floating point error reporting. 


When asserted, the cache flush input forces the Pentium processor to writeback 
all modified lines in the data cache and invalidate its internal caches. A Flush 
Acknowledge special cycle will be generated by the Pentium processor indicating 
completion of the writeback and invalidation. 


If FLUSH # is sampled low when RESET transitions from high to low, tristate test 
mode is entered. | 
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Table 2-2. Quick Pin Reference (Continued) 


| Symbol | Type* | Name and Function 


FRCMC# ss The Functional Redundancy Checking Master/Checker mode input is used to 


determine whether the Pentium processor is configured in master mode or checker 
mode. When configured as a master, the Pentium processor drives its output pins 

ue 

co a 


as required by the bus protocol. When configured as a checker, the Pentium 
processor tristates all outputs (except IERR# and TDO) and samples the output 
: EB 
‘s fae 
The instruction branch taken pin is driven active (high) for one clock to indicate that 
a branch was taken. This output is always driven by the Pentium processor. 


pins. 

clits # The internal error pin is used to indicate two types of errors, internal parity errors 
and functional redundancy errors. If a parity error occurs on a read from an internal 
array, the Pentium processor will assert the IERR # pin for one clock and then 
shutdown. If the Pentium processor is configured as a checker and a mismatch 
occurs between the value sampled on the pins and the corresponding value 


The Ait indication is driven to reflect the outcome of an inquire cycle. If an inquire 
cycle hits a valid line in either the Pentium processor data or instruction cache, this 
pin is asserted two clocks after EADS # is sampled asserted. If the inquire cycle 
misses Pentium processor cache, this pin is negated two clocks after EADS #. This 
pin changes its value only as a result of an inquire cycle and retains its value 
between the cycles. 


The hit to a modified line output is driven to reflect the outcome of an inquire cycle. 
It is asserted after inquire cycles which resulted in a hit to a modified line in the data 
cache. It is used to inhibit another bus master from accessing the data until the line 
is completely written back. 


The bus hold acknowledge pin goes active in response to a hold request driven to 
the processor on the HOLD pin. It indicates that the Pentium processor has floated 
most of the output pins and relinquished the bus to another local bus master. When 
leaving bus hold, HLDA will be driven inactive and the Pentium processor will 
resume driving the bus. If the Pentium processor has bus cycle pending, it will be 
driven in the same clock that HLDA is deasserted. 


In response to the bus hold request, the Pentium processor will float most of its 
output and input/output pins and assert HLDA after completing all outstanding bus 
cycles. The Pentium processor will maintain its bus in this state until HOLD is 
deasserted. HOLD is not recognized during LOCK cycles. the Pentium processor 
will recognize HOLD during reset. 


The configuration as a master/checker is set after RESET and may not be 
changed other than by a subsequent RESET. 

computed internally, the Pentium processor will assert IERR # two clocks after the 
mismatched value is returned. 


IGNNE# This is the ignore numeric error input. This pin has no effect when the NE bit in CRO 
is set to 1. When the CRO.NE bit is 0, and the IGNNE # pin is asserted, the Pentium 
processor will ignore any pending unmasked numeric exception and continue 
executing floating point instructions for the entire duration that this pin is asserted. 
When the CRO.NE bit is 0, IGNNE # is not asserted, a pending unmasked numeric 
exception exists (SW.ES = 1), and the floating point instruction is one of FINIT, 
FCLEX, FSTENV, FSAVE, FSTSW, FSTCW, FENI, FDISI, or FSETPM, the Pentium 
processor will execute the instruction in spite of the pending exception. When the 


CRO.NE bit is 0, IGNNE # is not asserted, a pending unmasked numeric exception 
exists (SW.ES = 1), and the floating point instruction is one other than FINIT, 
FCLEX, FSTENV, FSAVE, FSTSW, FSTCW, FENI, FDISI, or FSETPM, the Pentium 
processor will stop execution and wait for an external interrupt. 
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Table 2-2. Quick Pin Reference (Continued) 


| Symbol | Type* | Name and Function 


The Pentium processor initialization input pin forces the Pentium processor to begin 
execution in a known state. The processor state after INIT is the same as the state _ 
after RESET except that the internal caches, write buffers, and floating point 
registers retain the values they had prior to INIT. INIT may NOT be used in lieu of 
RESET after power-up. 


If INIT is sampled high when RESET transitions from high to low the Pentium 


processor will perform built-in self test prior to the start of program execution. 


An active maskable interrupt input indicates that an external interrupt has been 
generated. If the IF bit in the EFLAGS register is set, the Pentium processor will 
generate two locked interrupt acknowledge bus cycles and vector to an interrupt 
handler after the current instruction execution is completed. INTR must remain active 
until the first interrupt acknowledge cycle is generated to assure that the interrupt is 
recognized. 


The invalidation input determines the final cache line state (S or |) in case of an 
inquire cycle hit. It is sampled together with the address for the inquire cycle in the 
clock EADS # is sampled active. 


The u-pipe instruction complete output is driven active (high) for 1 clock to indicate 
that an instruction in the u-pipeline has completed execution. This pin is always 
driven by the Pentium processor. 


The v-pipe instruction complete output is driven active (high) for one clock to indicate 
that an instruction in the v-pipeline has completed execution. This pin is always 
driven by the Pentium processor. 


The cache enable pin is used to determine whether the current cycle is cacheable or 
not and is consequently used to determine cycle length. When the Pentium 
processor generates a cycle that can be cached (CACHE # asserted) and KEN # is 
active, the cycle will be transformed into a burst line fill cycle. 


The bus /ock pin indicates that the current bus cycle is locked. The Pentium 
processor will not allow a bus hold when LOCK # is asserted (but AHOLD and 
BOFF # are allowed). LOCK # goes active in the first clock of the first locked bus 
cycle and goes inactive after the BRDY # is returned for the last locked bus cycle. 
LOCK # is guaranteed to be deasserted for at least one clock between back to back 
locked cycles. 


The Memory/Input-Output is one of the primary bus cycle definition pins. It is driven 
valid in the same clock as the ADS # signal is asserted. M/IO# distinguishes 
between memory and I/O cycles. 


An active next address input indicates that the external memory system is ready to 
accept a new bus cycle although all data transfers for the current cycle have not yet 
completed. The Pentium processor will drive out a pending cycle two clocks after 

NA # is asserted. The Pentium processor supports up to 2 outstanding bus cycles. 


The non-maskable interrupt request signal indicates that an external non-maskable 
interrupt has been generated. 


The page cache disable pin reflects the state of the PCD bit in CR3, the Page 
Directory Entry, or the Page Table Entry. The purpose of PCD is to provide an 
external cacheability indication on a page by page basis. 


The parity check output indicates the result of a parity check on a data read. It is 
driven with parity status two clocks after BRDY # is returned. PCHK # remains low 

one clock for each clock in which a parity error was detected. Parity is checked only 
for the bytes on which valid data is returned. 
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Table 2-2. Quick Pin Reference (Continued) 


| Symbol | Type*_ Name and Function | 


The parity enable input (along with CR4.MCE) determines whether a machine 
check exception will be taken as a result of a data parity error on a read cycle. 
If this pin is sampled active in the clock a data parity error is detected, the 
Pentium processor will latch the address and control signals of the cycle with 
the parity error in the machine check registers. If in addition the machine check 
enable bit in CR4 is set to “1”, the Pentium processor will vector to the 
machine check exception before the beginning of the next instruction. 


PM/BP[1:0]B 


These pins function as part of the Performance Monitoring feature. 


The breakpoint pins BP[1:0] are multiplexed with the Performance Monitoring 
pins PM[1:0]. The PB1 and PBO bits in the Debug Mode Control Register 

determine if the pins are configured as breakpoint or performance monitoring 
pins. The pins come out of reset configured for performance monitoring. 


The PRDY output pin indicates that the processor has stopped normal 
execution in response to the R/S# pin going active, or Probe Mode being 
entered. 


The page write through pin reflects the state of the PWT bit in CR3, the Page 
Directory Entry, or the Page Table Entry. The PWT pin is used to provide an 
external writeback indication on a page by page basis. 


The R/S # input is an asynchronous, edge sensitive interrupt used to stop the 
normal execution of the processor and place it into an idle state. A high to low 
transition on the R/S# pin will interrupt the processor and cause it to stop 

execution at the next instruction boundary. 


RESET Reset forces the Pentium processor to begin execution at a known state. All 
the Pentium processor internal caches will be invalidated upon the RESET. 


Modified lines in the data cache are not written back. 


FLUSH #, FRCMC# and INIT are sampled when RESET transitions from high 
to low to determine if tristate test mode or checker mode will be entered, or if 
BIST will be run. 


The split cycle output is asserted during misaligned LOCKed transfers to 
indicate that more than two cycles will be locked together. This signal is 
defined for locked cycles only. It is undefined for cycles which are not locked. 


SCYC 


SMI # The system Management Interrupt causes a system management interrupt 
request to be latched internally. When the latched SMI # is recognized on an 


instruction boundary, the processor enters System Management Mode. 


An active system management interrupt active output indicates that the 
processor is operating in System Management Mode (SMM). 


The testability clock input provides the clocking function for the Pentium 
processor boundary scan in accordance with the IEEE Boundary Scan 

interface (Standard 1149.1). It is used to clock state information and data into 
and out of the Pentium processor during boundary scan. 


SMIACT # 


BS) Uv Uv 
ee) ee 
< 


TCK 


The test data input is a serial input for the test logic. TAP instructions and data 
are shifted into the Pentium processor on the TDI pin on the rising edge of TCK 
when the TAP controller is in an appropriate state. 


TDI 
The test data output is a serial output of the test logic. TAP instructions and 
data are shifted out of the Pentium processor on the TDO pin on the falling 


~*TDO 
edge of TCK when the TAP controller is in an appropriate state. 
TMS The value of the fest mode select input signal sampled at the rising edge of 
TCK controls the sequence of TAP controller state changes. 
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Table 2-2. Quick Pin Reference (Continued) 


| Symbol | Type* : Name and Function 


TRST # When asserted, the fest reset input allows the TAP controller to be 
asynchronously initialized. , 


W/R# Write/Read is one of the primary bus cycle definition pins. It is driven valid in the 
same clock as the ADS # signal is asserted. W/R# distinguishes between write 
and read cycles. 


WB/WT # The writeback/writethrough input allows a data cache line to be defined as write 
back or write through on a line by line basis. As a result, it determines whether a 
cache line is initially in the S or E state in the data cache. 


NOTE: 
*The pins are classified as Input or Output based on their function in Master Mode. See the Functional Redundancy Check- 
ing section in the “Error Detection” chapter of the Pentium™ Processor User’s Manual, Vol. 1, for further information. 
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2.4 Pin Reference Tables 
Table 2-3. Output Pins 


PM1/BP1, 
PM0/BPO 


Bus Hold, BOFF # 


Bus Hold, BOFF # 


All states except 
Shift-DR and Shift-iR 


NOTE: 
All output and input/output pins are floated during tri- 
state test mode and checker mode (except IERR #). 
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Table 2-4. Input Pins 
Active; Synchronous/ | Internal Qualified 
Level | Asynchronous | resistor; | 


Avo. [wan [Seon | | 


porr* [Low [synchronous | | 


a ie cman ee oa 


State T2, 
BuscHKs |LOW |Synchronous | Pullup_| BROY#_ 
He ree ndeiy 


112, fer 
n 


/a 
EADS # Synchronous 


EWBE # Synchronous 
FLUSH # Asynchronous 
FRCMC# Asynchronous 


.@) 
e 
A 


= 
O 
a 
Oo 
ae 
G) 
be 


Synchronous 


Asynchronous 
INIT Asynchronous 
INTR Asynchronous 
INV Synchronous 


LOW Synchronous 


HIGH Asynchronous 

Synchronous 
R/S# Asynchronous 
RESET HIGH | Asynchronous 

Asynchronous 


TDI Synchronous/TCK 
TMS Synchronous/TCK 


Asynchronous 


/WT# Synchronous 


EADS# 
EWBE# 
FLUSH# 
FROMC# 
HOLD 
IGNNE# 
at 
NTR | 
weak 
LOW | Synchronous 
NMI 
PEN# 
Rise 
RESET 
SMe 
TOK 
sa 
™s | 


= 
wo 
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Table 2-5. Input/Output Pins 


Qualified 
Active Level When Floated (when an input) 


| 
/a 


Bus Hold, BOFF # BRDY # 
DP7-DPO Bus Hold, BOFF # BRDY # 


NOTE: 


All output and input/output pins are floated during tristate test mode (except TDO) and checker mode (except IERR# and 
TDO). 
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2.5 Pin Grouping According to 
Function : 


Table 2-6 organizes the pins with respect to their 
function. 


Table 2-6. Pin Functional Grouping 


DP7-DPO, PCHK#, 
PEN# 


M/IO#, D/C#, W/R#, 
CACHE #, SCYC, 
LOCK # 


Consistency | HITM#, INV 


Bus Arbitration BOFF #, BREQ, HOLD, 
HLDA 


Floating Point Error | FERR#,|IGNNE# 
Reporting 

System SMI #, SMIACT # 
Management Mode 


Functional FRCMC # (IERR#) 
Redundancy 
Checking 


TAP Port TCK, TMS, TDI, TDO, 
TRST # 


Breakpoint/ PM0O/BPO, PM1/BP1, 
Performance BP3-2 
Monitoring 


BT3-BTO, IU, IV, IBT 
Probe Mode R/S#, PRDY 
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2.6 Output Pin Grouping According to 
when Driven 


This section groups the output pins according to 
when they are driven. 


Group 1 


The following output pins are driven active at the 
beginning of a bus cycle with ADS#. A31-A3 and 
AP are guaranteed to remain valid until AHOLD is 
asserted or until the earlier of the clock after NA# or 
the last BRDY #. The remaining pins are guaranteed 
to remain valid until the earlier of the clock after 
NA# or the last BRDY #: 


A31-A3, AP, BE7#-0#, CACHE#, 
W/R#, D/C#, SCYC, PWT, PCD. 


M/IO#, 


Group 2 


As outputs, the following pins are driven in T2, T12, 
and T2P. As inputs, these pins are sampled with 
BRDY #: 


D63-—0, DP7-0. 


Group 3 


These are the status output pins. They are always 
driven: 


BREQ, HIT#, HITM#, IU, IV, IBT, BT3-BTO, 
PMO/BPO, PM1/BP1, BP3, BP2, PRDY, SMIACT #. 
Group 4 


These are the glitch free status output pins. 


APCHK#, 
PCHK#. 


FERR#, HLDA, IERR#, LOCK#, 


3.0 ELECTRICAL SPECIFICATIONS 


3.1 Power and Ground 


For clean on-chip power distribution, the Pentium 
processor has 50 Vcc (power) and 49 Vss (ground) 
inputs. Power and ground connections must be 
made to all external Vcc and Vss pins of the Penti- 
um processor. On the circuit board, all Vcc pins 
must be connected to a Vcc plane. All Vss pins 
must be connected to a Vssg plane. 
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3.2 Decoupling Recommendations 


Liberal decoupling capacitance should be placed 
near the Pentium processor. The Pentium processor 
driving its large address and data buses at high fre- 
quencies can cause transient power surges, particu- 
larly when driving large capacitive loads. 


Low inductance capacitors (i.e. surface mount ca- 
pacitors) and interconnects are recommended for 
best high frequency electrical performance. Induc- 
tance can be reduced by connecting capacitors di- 
rectly to the Vcc and Vss planes, with minimal trace 
length between the component pads and vias to the 
plane. Capacitors specifically for PGA packages are 
also commercially available. 


These capacitors should be evenly distributed 
among each component. Capacitor values should 
be chosen to ensure they eliminate both low and 
high frequency noise components. 


3.3 Connection Specifications 


All NC pins must remain unconnected. 


For reliable operation, always connect unused in- 
puts to an appropriate signal level. Unused active 
low inputs should be connected to Voc. Unused ac- 
tive high inputs should be connected to ground. 
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3.4 Maximum Ratings 


Table 3-1 is a stress rating only. Functional opera- 
tion at the maximums is not guaranteed. Functional 
operating conditions are given in the A.C. and D.C. 
specification tables. 


Extended exposure to the maximum ratings may af- 
fect device reliability. Furthermore, although the 
Pentium processor contains protective circuitry to 
resist damage from static electric discharge, always 
take precautions to avoid high static voltages or 
electric fields. 


Table 3-1. Absolute Maximum Ratings 


Case temperature | —65°C to + 110°C 
under bias 

Storage —65°C to + 150°C 
temperature 


Voltage on any 
pin with respect 
to ground 


Supply voltage —0.5V to +6.5V 
with respect to 
Vss 


—0.5 Vcc to Voc + 0.5 (V) 
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3.5 D.C. Specifications 


Table 3-2 lists the D.C. specifications associated with the Pentium processor. 


Table 3-2. Pentium™ Processor D.C. Specifications Voc = See Notes 10, 11; Tease = See Notes 12, 13 


[symbol | Parameter 
in _—_| tputow Votags 
in | input igh Votags 
Tor | Output ow Voiags 
Tow | Outnut 

re. 
oe 


Power Supply Current 


Input Leakage Current 
Output Leakage Current 
Input Leakage Current 


i 


Input Leakage Current 


Input Capacitance 
x: Sa Output Capacitance 
1/O Capacitance 


Pig = 
ae 
on emer 
ee 

patta.'| 

Cox | CLK inputCapacitance | 
ae 

aes 

Lie 


ViL 
ViIH 
VoL 
VOH 
loc 
lu 
ILo 
HL 
NH 
CIN 
Co 


CTIN Test Input Capacitance 
Test Output Capacitance 
Test Clock Capacitance 


NOTES: 

. Parameter measured at 4 mA load. 

. Parameter measured at 1 mA load. 

. This parameter is for input without pullup or pulldown. 
. This parameter is for input with pullup. 

. This parameter is for input with pulldown. 

. Worst case average Icc for a mix of test patterns. 


ONOoah hn — 


| Min 
| =03 
| 20 
2S 
Output High Voltage | 24 | 
eae. 
Ere 
aaaed 


Unit 
i. Se 
Bf WAN 
ae 
aN 
2910 A 60 MHz, (7), (9) 
| uA 
uA 
| uA 
| uA 


ae Be 
Be ie 


i eee ee 


. (16W max.) Typical Pentium processor supply current is 2600 mA (13W) at 66 MHz. 


9. (14.6W max.) Typical Pentium processor supply current is 2370 mA (11.9W) at 60 MHz. 


10. Voc = 5V + 5% at 60 MHz. 

11. Voc = 4.9V to 5.40V at 66 MHz. 
12. Tcase = O°C to +80°C at 60 MHz. 
13. Tcase = O°C to + 70°C at 66 MHz. 


3.6 A.C. Specifications 


The 66 MHz and 60 MHz A.C. specifications given in 
Tables 3-3 and 3-4 consist of output delays, input 
setup requirements and input hold requirements. All 
A.C. specifications (with the exception of those for 
the TAP signals) are relative to the rising edge of the 
CLK input. 


All timings are referenced to 1.5 volts for both ‘‘O” 
and ‘1” logic levels unless otherwise specified. 
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Within the sampling window, a synchronous input 
must be stable for correct Pentium processor opera- 
tion. 


Care should be taken to read all notes associated 
with a particular timing parameter. In addition, the 
following list of notes apply to the timing specifica- 
tion tables in general and are not associated with 
any one timing. They are 2, 5, 6, and 14. 
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Table 3-3. 66 MHz Pentium™ Processor A.C. Specifications 
Voc = 4.9V to 5.40V; Tease = 0°C to + 70°C; C; = O pF 


Parameter 


or eae 
(18), (19), (20), (21) 
@2V, (1) 


CLK Rise Time 0.15 


ADS #, A3-A31, BTO-3, PWT, 
PCD, BEO-7#, M/IO#, D/C#, 
W/R#, CACHE#, SCYC, LOCK # 
Valid Delay 


AP Valid Delay 


ADS #, AP, A3—-A31, BTO-3, 
PWT, PCD, BEO-7#, M/IO#, 
D/C#, W/R#, CACHE#, SCYC, 
LOCK # Float Delay 


PCHK #, APCHK#, IERR#, 
FERR# Valid Delay 

BREQ, HLDA, SMIACT # Valid 
Delay 


HIT #, HITM# Valid Delay 


PMO-1, BPO-3, IU, IV, IBT Valid 
Delay 


. la PRDY Valid Delay 


DO-D63, DPO-7 Write Data Valid 
Delay 

DO-63, DPO-7 Write Data Float 
Delay 


KEN, WB/WT #, NA# Hold 
Time 
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Table 3-3. 66 MHz Pentium™ Processor A.C. Specifications 
Voc = 4.9V to 5.40V; Tcase = 0°C to + 70°C; CL = 0 pF (Continued) 


|Symbol | Parameter | Min 
BUSCHK#, EWBE #, HOLD, PEN# 5 
Setup Time 
BUSCHK #, EWBE #, HOLD, PEN# 15 
Hold Time . 
/ tag | A20M#, INTR, Setup Time ae ae 
| tz7 | A20M#, INTR, Hold Time 
INIT, FLUSH#, NMI, SMI#, IGNNE# 
Setup Time 
INIT, FLUSH #, NMI, SMI#, IGNNE # 1.5 
Hold Time 
INIT, FLUSH#, NMI, SMI#, IGNNE # 2 
Pulse Width, Async 


(12), (16) 


( 


(15), (17) 


(12), (16), (17) 
(15), (17) 


R/S# Setup Time 
R/S# Hold Time 


R/S# Pulse Width, Async 


ies oye DO-D63 Read Data Setup Time 


(11), (12), (16) 


(11), (13) 


Ties | 00-069, DPO-7 Read Data HoldTime | 2 
Oe a ee 
Tey | RESETHoTIme ——SSS«*d; CS 
Pies | RESET Pulse Width, Voo& GLK Stabe | 15 _ 
east 


tao Pentium processor Reset Configuration 
Signals (INIT, FLUSH #, FRCMC#) 
Setup Time 


Pentium processor Reset Configuration 
Signals (INIT, FLUSH #, FRCMC #) Hold 
Time 


Pentium processor Reset Configuration 
Signals (INIT, FLUSH #, FRCMC #) 
Setup Time, Async. 


Pentium processor Reset Configuration 


Signals (INIT, FLUSH #, FRCMC#) Hold 
Time, Async. , 


TCK Frequency 
TCK Period 


eS 
me 
a 
ras 
ihe 
ee 
i 
Teun | DPO-7 Read Data Seup tine | se | 
oe 
Saas 
eas 
co 
ee 
6 
ee 
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Table 3-3. 66 MHz Pentium™ Processor A.C. Specifications 
Voc = 4.9V to 5.40V; Toease = 0°C to + 70°C; CL = O pF (Continued) 


[Parameter | win | Max 
TroKrigntime ia 
rok towTime ida 
sae 
as 


[rigwe | Notes 


| Unit | 

ns 

pins 
pons 
ns 
/ ns 
prs | 36 
eee Ea 
prs | 36 
prs | 36 
ns _| 
ns 
_ ns 
_ns_| 


| Symbol 

igh T ba 
i es 
[tue | TCKFallT ss 
ea 
[so | TRST#Pulsewiath =| 40 | 
‘ts; | TOL TMSSetuptime | 5 | 
[so | TDL TMSHoldTime | 13 | 
[iss | TOOValidDelay | 8 | 20 
sq | TOOFloatDelay || 8 
ss | AllNon-Test Outputs Valid Delay | 3 | 20 | 
sg __| AllNon-Test Outputs FloatDelay |_| 25 | ns | 36 | (1),(3),(8)(10) 
All Non-Test Inputs Setup Time at ee Se | ONO S| R53) 
ray rit [re | 36 [Om 
NOTES: 


1. Not 100% tested. Guaranteed by design/characterization. 

2. TTL input test waveforms are assumed to be 0 to 3 Volt transitions with 1Volt/ns rise and fall times. 

3. Non-Test Outputs and Inputs are the normal output or input signals (besides TCK, TRST#, TDI, TDO, and TMS). These 
timings correspond to the response of these signals due to boundary scan operations. 

4. APCHK#, FERR#, HLDA, IERR#, LOCK#, and PCHK# are glitch free outputs. Glitch free signals monotonically tran- 
sition without false transitions (i.e. glitches). 

5. 0.8 V/ns < CLK input rise/fall time < 8 V/ns. 

6. 0.3 V/ns < Input rise/fall time < 5 V/ns. 

7. Referenced to TCK rising edge. 

8. Referenced to TCK falling edge. 

9. Ins can be added to the maximum TCK rise and fall times for every 10 MHz of frequency below 16 MHz. 

10. During probe mode operation, use the normal specified timings. Do not use the boundary scan timings (ts55.58). 

11. FRCMC# should be tied to Vcc (high) to ensure proper operation of the Pentium processor as a master Pentium 
processor. 

12. Setup time is required to guarantee recognition on a specific clock. 

13. Hold time is required to guarantee recognition on a specific clock. 

14. All TTL timings are referenced from 1.5 V. 

15. To guarantee proper asynchronous recognition, the signal must have been deasserted (inactive) for a minimum of 2 
clocks before being returned active and must meet the minimum pulse width. 

16. This input may be driven asynchronously. 

17. When driven asynchronously, NMI, FLUSH#, R/S#, INIT, and SMI# must be deasserted (inactive) for a minimum of 2 
clocks before being returned active. 

18. Functionality is guaranteed by design/characterization. 

19. Measured on rising edge of adjacent CLKs at 1.5V. 

20. To ensure a 1:1 relationship between the amplitude of the input jitter and the internal and external clocks, the jitter 
frequency spectrum should not have any power spectrum peaking between 500 KHz and 1 of the CLK operating frequency. 
21. The amount of jitter present must be accounted for as a component of CLK skew between devices. 
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All Non-Test Inputs Hold Time 
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Table 3-4. 60 MHz Pentium™ Processor A.C. Specifications 
Voc = 5V +5%; Tease = 0°C to + 80°C; CL = 0 pF 


ADS #, A3-A31, BTO-3, PWT, PCD, 
BEO-7#, M/IO#, D/C#, W/R#, 
CACHE #, SCYC, LOCK # Valid Delay 


ADS #, AP, A3-A31, BTO-3, PWT, 
PCD, BEO-7#, M/IO#, D/C#, W/R#, 
CACHE #, SCYC, LOCK # Float Delay 


Cal 


tia 


ti8a 
t 


t 


tio 
ty 
ti2 
t14 
t15 
ti6 
t17 
tig 

19 

21 

24 


t 


BUSCHK#, EWBE#, HOLD, PEN# 

Setup Time 

BUSCHK#, EWBE#, HOLD, PEN# 15 
Hold Time 
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Table 3-4. 60 MHz Pentium™ Processor A.C. Specifications 
Voc = 5V +5%; Tcase = 0°C to + 80°C; C, = 0 pF (Continued) 


A20M#, INTR, Setup Time 


t 
t 


A20M+#, INTR, Hold Time es 
INIT, FLUSH#, NMI, SMI#, IGNNE # 
Setup Time 
INIT, FLUSH#, NMI, SMI#, IGNNE # 

Hold Time 


R/S# Setup Time 
R/S# Hold Time 


| Min 
| 55 
55 
p15 i aes 
ee a ee 
ee ee es 
A in fe MB BAY ee one 
Se he Pe ee ee ee 
, 55 | | ns | 35 | (14), (12),(16) 
|RESETHodTime | 5 || ons || ct) 
| RESET Pulse Width, Voc&CLKStable | 15 | | ciks | 36 | 1) 
RESET Active after Voo &CLK Stable | 1 
eae 
62.5 
| 25 
| 25 


t 
t 
t 


arate 4 
Path 
wens 
| D0-DE3ReadDataSetupTime | 43 | 
-DPO-7ReadDataSetupTime | 43 | 
ee 


DO-—D63, DPO-—7 Read Data Hold Time 
RESET Setup Time 


27 
28 
30 
31 
34 
37 


INIT, FLUSH #, NMI, SMI#, IGNNE# 
Pulse Width, Async 


t 


R/S# Pulse Width, Async 


Pentium Processor Reset 
Configuration Signals (INIT, FLUSH#, 
FRCMC #) Setup Time 


Pentium Processor Reset 
Configuration Signals (INIT, FLUSH #, 
FRCMC #) Hold Time 


Pentium Processor Reset 
Configuration Signals (INIT, FLUSH#, 
FRCMC #) Setup Time, Async. 


Pentium Processor Reset 
Configuration Signals (INIT, FLUSH#, 
FRCMC #) Hold Time, Async. 


TCK Frequency 


lana 
oS 
—_ 


TCK Period 
TCK High Time 
TCK Low Time 


TCK Fall Time 
TCK Rise Time 


t45 
t47 
49 


t 


(2.0V-0.8V), 
(1), (8), (9) 
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Table 3-4. 60 MHz Pentium™ Processor A.C. Specifications 
Voc = 5V +5%; Tcase = 0°C to + 80°C; C, = 0 pF (Continued) 


[Parameter | in 
TOL TMS SetupTime +t 8 
rE 


| Symbol 

| eo. HS 

ez 8 
[ss | TDOValidDela | 8 
[tsa | TDOFloatDelay | | 36 

ss | _AllNon-Test Outputs Valid Delay | 3 | 36 | @(@).(10) 
tse | AllNon-Test Outputs Float Delay | 36 | (1), (10) 
ae 36 

pti ese Beas 


All Non-Test Inputs Setup Time 


All Non-Test Inputs Hold Time 


(3), (7), (10) 
(3), (7), (10) 


NOTES: 

1. Not 100% tested. Guaranteed by design/characterization. 

2. TTL input test waveforms are assumed to be 0 to 3 Volt transitions with 1Volt/ns rise and fall times. 

3. Non-Test Outputs and Inputs are the normal output or input signals (besides TCK, TRST#, TDI, TDO, and TMS). These 
timings correspond to the response of these signals due to boundary scan operations. 

4. APCHK#, FERR#, HLDA, IERR#, LOCK#, and PCHK# are glitch free outputs. Glitch free signals monotonically tran- 
sition without false transitions (i.e. glitches). 

5. 0.8 V/ns < CLK input rise/fall time < 8 V/ns. 

6. 0.3 V/ns < Input rise/fall time < 5 V/ns. 

7. Referenced to TCK rising edge. 

8. Referenced to TCK falling edge. 

9. 1ns can be added to the maximum TCK rise and fall times for every 10 MHz of frequency below 16 MHz. 

10. During probe mode operation, use the normal specified timings. Do not use the boundary scan timings (ts55-59). 

11. FRCMC# should be tied to Voc (high) to ensure proper operation of the Pentium processor as a master Pentium 
processor. 

12. Setup time is required to guarantee recognition on a specific clock. 

13. Hold time is required to guarantee recognition on a specific clock. 

14. All TTL timings are referenced from 1.5 V. 

15. To guarantee proper asynchronous recognition, the signal must have been deasserted (inactive) for a minimum of 2 
clocks before being returned active and must meet the minimum pulse width. 

16. This input may be driven asynchronously. 

17. When driven asynchronously, NMI, FLUSH#, R/S#, INIT, and SMI# must be deasserted (inactive) for a minimum of 2 
clocks before being returned active. 

18. Functionality is guaranteed by design/characterization. 

19. Measured on rising edge of adjacent CLKs at 1.5V. 

20. To ensure a 1:1 relationship between the amplitude of the input jitter and the internal and external clocks, the jitter 
frequency spectrum should not have any power spectrum peaking between 500 KHz and 1% of the CLK operating frequency. 
21. The amount of jitter present must be accounted for as a component of CLK skew between devices. 
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Each valid delay is specified for a 0 pF load. The system designer should use I/O buffer modeling to account 
for signal flight time delays. 


Tv = t5, t49 

Tw = t4, t48 

Tx = t3, t47 : 241595-5 
Ty = t1, t45 

Tz = t2, t46 


Figure 3-1. Clock Waveform 


1.5V VALID 


241595-6 
Tx = 6, t6a, 18, t9, t10, t11, tita, t12 


Figure 3-2. Valid Delay Timings 


Tx = t7, t13 241595-7 
Ty = t6min, t12min 


Figure 3-3. Float Delay Timings | 
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Tx = t14, t16, t18, t18a, t20, t22, t24, t26, t28, t31, 34, t84a 
Ty = t15, t17, t19, t21, t23, t25, t27, t29, t32, 135 
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Figure 3-4. Setup and Hold Timings 
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Output 
Signals 


Signal 
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Figure 3-6. Test Timings 
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Figure 3-7. Test Reset Timings 


4.0 MECHANICAL SPECIFICATIONS Figure 4-1 shows the package dimensions for the 


Pentium processor. The mechanical specifications 
The Pentium processor is packaged in a 273 pin ce- are provided in Table 4-2. 
ramic pin grid array (PGA). The pins are arranged in 
a 21 by 21 matrix and the package dimensions are 
2.16” X 2.16” (Table 4-1). 


Table 4-1. Pentium™ Processor Package Information Summary 
Package | Total : Estimated 
Pentium PGA 273 21 X 21 2.16" x 216" 16 
Processor 5.49 cm xX 5.49 cm 


NOTE: 
See D.C. Specifications for more detailed power specifications. 


PIN D4 
(signal 015 


Oo00O0oOfpoocoo0coo0co0o0co0ao0o0c0o0o00g0d 0 
o0 Oo @ooodoooodcao0oo0ao0ao0ac0od 8 

o0o0o0o0o0o0oco0o0g00o00g0g00g0g 0 0 
o0o0o0eo0o0o0o0o0o 0ce 0cO0 0000 000 DO OH 
o0o0o0o0 0c00 0000000000000 
o0o0o0co0o0o0o0ao0o0o0o0o0q0000gn0e0 0c 00 0 


2-29 REF. 


hoe X BS ENER) 


241595-12 


Figure 4-1. Pentium™ Processor Package Dimensions 
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Figure 4-2. Pentium™ Processor Package Dimensions 


Table 4-2. Pentium™ Processor Mechanical Specifications 


Family: Ceramic Pin Grid Array Package 


Millimeters 


0.185 
0.017 
0117 
0.048 
0.020 
2.168 


Solid Lid 
Solid Lid 


Heal | 
| 
ee eres 


8.382 0.330 


1.510 
1.612 


Spreader Size 


Braze 


Total Pins 


- 0.127 Flatness of Flatness of 
spreader spreader 
measured measured 

diagonally diagonally 
16 
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5.0 THERMAL SPECIFICATIONS 


The Pentium processor is specified for proper opera- 
tion when Tc (case temperature) is within the speci- 
fied range. To verify that the proper Tc is main- 
tained, it should be measured at the center of the 
top surface (opposite of the pins) of the device in 
question. To minimize the measurement errors, it is 
recommended to use the following approach: 


e Use 36 gauge or finer diameter k, t, or j type ther- 
mocouples. The laboratory testing was done us- 
ing a thermocouple made by Omega (part num- 
ber: 5TC-TTK-36-36). 


PENTIUM™ PROCESSOR 


e Attach the thermocouple bead or junction to the 
center of the package top surface using high 
thermal conductivity cements. The laboratory 
testing was done by using Omega Bond (part 
number: OB-100). 


e The thermocouple should be attached at a 90 de- 
grees angle as shown in Figure 5-1. When a heat 
sink is attached, a hole (no larger than 0.15”) 
should be drilled through the heat sink to allow 
probing the center of the package as shown in 
Figure 5-1. 

e |f the case temperature is measured with a heat 
sink attached to the package, drill a hole through 
the heat sink to route the thermocouple wire out. 


241595-13 


Figure 5-1. Technique for Measuring Tcase 


An ambient temperature T, is not specified directly. 
The only restriction is that Tc is met. To determine 
the allowable Ta values, the following equations 
may be used: 


Ty = TC + (P * Ojo) 
Ta = Ty — (P* Oya) 
Goa = Cyn — Bjc 


Th = Te Bea) 


where, Ty, Ta, and Tc = Junction, Ambient and 
Case Temperature, respectively. 


Ojc, Oya, and Oca =  Junction-to-Case, 
Junction-to-Ambient, and Case-to-Ambient 
Thermal Resistance, respectively. 


P = Maximum Power Consumption 


Table 5-1 lists the 6jc and Oca values for the Penti- 
um processor. 


Table 5-1. Junction-to-Case and Case-to-Ambient Thermal Resistances for the Pentium™ Processor | 
(With and Without a Heat Sink) 


NOTE: 


9ca VS Airflow (ft/min) 


1. Heat Sink: 2.1 sq. in. base, omni-directional pin Al heat sink with 0.050 in. pin width, 0.143 in pin-to-pin center spacing and 
0.150 in. base thickness. Heat sinks are attached to the package with a 2 to 4 mil thick layer of typical thermal grease. The 
thermal conductivity of this grease is about 1.2 w/m °C. 

2. 8ca values shown in Table 5-1. are typical values.The actual 9cq values depend on the air flow in the system (which is 
typically unsteady, non uniform and turbulent) and thermal interactions between Pentium™ processor and surrounding com- 
ponents though PCB and the ambient. 
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OverDrive™ PROCESSOR FOR PENTIUM™ PROCESSOR- 
BASED SYSTEMS SOCKET SPECIFICATION 


m Powerful Performance Boosters for m OverDrive Processors Upgrade 
Pentium™ Processor-Based Systems Systems Based on 

— Improve Overall System — 60 MHz Pentium Processors 

Performance by an average of 50% — 66 MHz Pentium Processors 


— Increase Both Integer and Floating- 


Point Performance = Binary Compatible with Large Installed 


Software Base 


m@ Superscalar Architecture , — MS-DOS*, OS/2*, Windows* 
— Two Pipelined Integer Units — UNIX* System V/386 
— Capable of under One Clock per — iRMX®, iRMK Kernels 


Instruction 
— Pipelined Floating-Point Unit 


1.0 FUNCTIONAL DESCRIPTION 


The OverDrive™ processor for Pentium™ processor-based systems(1) will offer significant increases in over- 
all system performance. Designed as an in-socket upgrade to the Pentium processor, the OverDrive processor 
will accelerate the performance of everyday DOS*, OS/2*, Windows* and UNIX* applications. 


NOTE: 
1. Subsequent references to the ‘“‘OverDrive processor for Pentium processor-based systems’”’ will be abbre- 
viated to ‘OverDrive processor’ in this document. The OverDrive processor for Pentium processor-based 
systems will be referred to as the “Higher performance Pentium OverDrive processor” in end-user advertis- 
ing during the 1993-1994 timeframe. 


The OverDrive processor will be socket compatible with the Pentium processor. The OverDrive processor for 
Pentium processor-based systems has the same pinout and A.C. timing specifications as the Pentium proces- 
sor. The OverDrive processor is packaged in a 273-pin, ceramic, pin grid array package with an attached fan/ 
heatsink present on the OverDrive processor chip. The active cooling solution will be composed of heat sink 
with attached fan. See Sections 5 and 6 in this document for OverDrive processor mechanical and thermal 
information. OEMs should be sure to address the OverDrive processor’s additional vertical height and required 
horizontal clearance due to the active cooling solution. 


Performance monitoring, a feature which allows trace instruction execution in order to optimize software code, 
will not be implemented the same on the Pentium processor and the OverDrive processor for Pentium proces- 
sor based systems. 


With the exception of the OverDrive processor’s mechanical, thermal and performance monitoring differences 


from the Pentium processor, the OverDrive processor’s functional characteristics will be compatible with the 
Pentium processor. 


*Other brands and names are the property of their respective owners. 
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2.0 OverDrive™ PROCESSOR 
UPGRADABILITY 


2.1 OverDrive™ Processor 
Upgradability Requirements 


A Pentium processor based system should be de- 
signed to meet certain requirements to support up- 
gradability. 


The system must feature a 273-pin, Zero Insertion 
Force (“ZIF’’) socket. 


The system, as originally shipped, must carry an 
Intel Pentium processor in the OverDrive socket. 


The OverDrive socket should be in a visible and eas- 
ily accessible location to facilitate end user removal 
of the Pentium processor and OverDrive processor 
installation in the same socket. Unacceptable loca- 
tions or conditions would include placement beneath 
daughter cards, or which require removal of disk 
drives or power supplies. Removal of bus cards to 
permit end user access to the OverDrive socket 
would be acceptable. 


The system should allow clearance of 1.2” above 
the processor for the OverDrive processor’s inte- 


grated fan/heatsink. This clearance is divided into 
the size of the fan/heatsink and the free space 


AMP 


aaa 69.00£0.5mm 


e0o0o00c9ccCC0CC0C0C0C0C0CC0000N0 
ecoco0000C0C0C0C0CCCCC0C0CN00N0 
eo0o0o0000cCcC0CCCCCCCC0CC0CCN0 
ecoo0o00c0cC0C0CC0CCCCCOC0C000 


57.3540.4mm 


64.00 £0.5mm 


e0000cC0CCCNCCOCCKCOCCCOOCOCOoO}. 
ecococ00c0C0C0CC CCC CCCC00N0 
ceoocoo00o0c0CCCCC0C0C0C00000 
PSC000CCOCNCCNCNCCCCCC0C00N0 
' 
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above the fan/heatsink needed to ensure proper air 
flow. 


The thermal requirements are described in Section 6 
of this document. 


The system should allow full movement of the 
OverDrive socket handle. For example, any heat 
sink attached to the Pentium processor should not 


- overhang the processor in any way that would im- 


pede full movement of the OverDrive socket handle. 


The system must not require any hardware or soft- 
ware modifications to operate the OverDrive proces- 
sor, including, but not limited to, jumper or switch 
setting changes, and/or BIOS or logic changes; e.g., 
jumper changes for frequency selection are accept- 
able if they are optional and not required for | 
OverDrive processor operation. 


OverDrive processor installation in the OEM system 
must not affect the system warranty. 


OEM should provide end user documentation with 
the OEM system describing the OverDrive processor 
installation process. 


2.2 OverDrive™ Socket 
The following drawings in Figure 2-1 show the pre- 
liminary worst case OverDrive socket footprints from 


Yamaichi 
ere 68.9710-2 mm 


occo0000C0C0CcC0CCCCCCO000N0 
ecoc0c000cc0000C0C0000000000 
eoc0c0000C0C0CCCCCCOC000N0 
oo0o00c0c0C0C0COCCOCOCCOC0C000 


67.5440.5mm 


59.90+0.4mm 
CO000DD00D000000000000 
e00D0D0DDDDDNNONONONONNNNN 
CO0000D000D0N0N00N000N0N0000 
C0O00DDDD000D0DN0NN0NNNNNN 
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Figure 2-1. OverDrive™ Socket Footprint Dimensions 
(See socket manufacturer for the most current information.) 
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2 potential OverDrive socket vendors, AMP and Ya- Figure 2-2 shows the OverDrive processor chip’s ori- 
maichi. OEMs should work directly with socket ven- entation in the OverDrive socket. 
dors for the most current socket information. 
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3.0 PIN DESCRIPTIONS description, see the Hardware Interface chapter in 

the Pentium Processor Data Book. Note that all in- 
The OverDrive processor pinout as well as the func- put pins must meet their A.C./D.C. specifications to 
tionality of the OverDrive processor’s pins are identi- guarantee proper functional behavior. Figures 3-1 


cal to that of the Pentium processor. For a detailed and 3-2 show the OverDrive processor pinout. 
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Figure 3-1. OverDrive™ Processor Pinout (Top View) 
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Figure 3-2. OverDrive™ Processor Pinout (Bottom View) 


Locations E17, S17 and S5 should be plugged on _ Locations N4 and L3 are ADSC# and BRDYC# re- 
the Pentium processor/OverDrive processor socket spectively if the 82496 cache controller and 82491 
in order to ensure that the Pentium processor/Over- cache SRAMs are used. 

Drive processor chip is installed in the socket with 

the correction orientation. 
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4.0 ELECTRICAL SPECIFICATIONS 


The OverDrive processor will have the same power 
and ground specifications, decoupling recommenda- 
tions, connection specifications and maximum rat- 
ings as the Pentium processor. 


intel. 


The OverDrive processor will have the same D.C. 
specifications as the Pentium processor, except that 
loc (Power Supply Current) is the following: 


4.1 D.C. Specifications 


Table 4-2. OverDrive Processor Icc Specification Voc = 5V +5%, Tsinx = 0°C to + 70°C 


Power Supply Current 


NOTE: 


66 MHz(1) : 
60 MHz(1) 


2700 
2500 


mA 
mA 


1. Worst case average Icc for a mix of test patterns. (The mix of test patterns will be determined after silicon is examined.) 


See The Pentium Processor Data Book for a listing 
of the remaining D.C. specifications. 


4.2 A.C. Specifications 


The OverDrive processor will have the same A.C. 
specifications as the Pentium processor. The func- 
tional parameters for the OverDrive processor’s A.C. 
specifications are the following: 


Voc = 5V +5% 
TsiInK = 0°C to + 70°C 
CL = 0 pF 


See the Pentium Processor Data Book for a listing of 
the A.C. specifications. 
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5.0 MECHANICAL SPECIFICATIONS 


The OverDrive processor for Pentium processor 
based systems is packaged in a 273-pin ceramic pin 
grid array (PGA) with attached fan/heatsink. The 
pins are arranged in a 21 x 21 matrix and the pack- 
age dimensions will be 2.16” x 2.16” (5.49 cm x 
5.49 cm). 


Table 5-1. OverDrive™ Processor Package 
Information Summary 
Max 


cy ae Package Size 
yP Wattage 
PGA 273. (21x21) 2.46" x2.16" 13.5 
_ |5.49cmx 5.49 cm 


NOTE: 
See D.C. Specifications for more detailed power specifica- 
tions. 


Estimated 


PRELIMINARY 


‘OverDrive™ PROCESSOR 


Table 5-2. OverDrive™ Processor Mechanical Specifications 


Solid Lid* 
Solid Lid 


0.112 0.138 


Solid Lid 0.013 0.017 


55.11 
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See Figure 5-3 for required free space from sides of the PGA package. 


Figure 5-1. OverDrive™ Processor Package Dimensions 
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As can be seen in the mechanical dimensions in Ta- 
ble 5-2 and Figure 5-1, the actual height required by 
the heat sink and fan is less than the total space 
allotted. Since the OverDrive processor for Pentium 
processor based systems employs a fan/heatsink, a 
certain amount of space is required above the fan/ 
heatsink unit to ensure that the airflow is not 
blocked. Figure 5-2 shows unacceptable blocking of 
the airflow for the OverDrive processor fan/heat- 
sink. Figure 5-3 details the minimum space needed 
around the PGA package to ensure proper heat sink 
airflow. 


intel. 


As shown in Figure 5-3, it is acceptable to allow any 
device (i.e., add-in cards, surface mount device, 
chassis, etc.) to enter within the free space distance 
of 0.2” from the PGA package if it is not taller than 
the level of the heat sink base. In other words, if a 
component is taller than height “B’, it cannot be 
closer to the PGA package than distance ‘‘A’’. This 
applies to all four sides of the PGA package, al- 
though the back and handle sides of a ZIF socket 
will generally automatically meet this specification 
since they have widths larger than distance “‘A”’. 


Minimum Air 


Obstruction 


RNAI... 


Not Acceptable 
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Figure 5-2. Active Heat Sink Top Space Requirements 
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Figure 5-3. Required Free Space From Sides of PGA Package 
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6.0 THERMAL SPECIFICATIONS 


The fan/heatsink cooling solution will properly cool 
the OverDrive processor as long as the maximum air 
temperature entering the fan/heatsink cooling solu- 
tion (Ta(in)) does not exceed 45°C. It is left up to the 
OEM to ensure that Ta(in) meets this specification 
by providing sufficient airflow around the OverDrive 
processor heat sink unit. 


Intel’s fan/heatsink will dissipate approximately 
0.5W and is powered by the chip such that no exter- 
nal wires or connections are required. The extra 
power needed for the fan is taken into account in the 
Icc numbers of the processor. Additionally, Intel is 
evaluating the feasibility of having the OverDrive 
processor monitor its temperature. No BIOS or hard- 
ware changes will be needed for this thermal protec- 
tion mechanism. The shut down temperature will be 
greater than the maximum temperature specification 
of the processor. The fan unit will be designed to be 
removable so that if fan failure should occur, the unit 
may be easily replaced. Figure 6-1 gives a functional 
representation of the processor and heat sink unit. 
The actual heat sink unit may be different from the 
one shown in the figure. # 
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Since the OverDrive processor for Pentium proces- 
sor based systems employs a fan/heatsink, it is not 
as important that the processor heat sink receive 
direct airflow, rather that the system has sufficient 
capability to remove the warm air that the OverDrive 


_processor will generate. This implies that enough air- 


flow exists at the OverDrive processor socket site to 
keep localized heating from occurring. This can be 
accomplished by a standard power supply fan with a 
clear path to the processor. It is recommended that 
the power supply use a fan that can supply a mini- 
mum of 35 CFM of airflow at the fan exit. This will 
help ensure that the air exchange rate of the system 
will be sufficient to meet the OverDrive processor 
thermal requirements. Figure 6-2 shows how system 
design can cause localized heating to occur by limit- 
ing the airflow in the area of the processor. The air- 
flow supplied in the system should also be enough 
to ensure that the OEM processor shipped with the 
system will meet the OEM processor thermal specifi- 
cations before the system is upgraded with the 
OverDrive processor. 


Free Space 


Fan 


Heat Sink 


RSS OOOO OOOO MOQ OQOO OOo OOo 


Ceramic PGA 
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Figure 6-1. Active Heat Sink Example 
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Figure 6-2. OverDrive™ Processor Airflow Design Examples 
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7.0 REVISION HISTORY 


Revision -003 of the OverDrive™ Processor for 
Pentium™ Processor-Based Systems Socket Speci- 
fication contains updates and improvements to the 
previous version. A revision summary of major 
changes is listed below: 


Global: Some of the electrical specifications have 
been removed since they are already shown 
in the Pentium Processor Data Book. These 
sections now contain references to the Pen- 
tium Processor Data Book. 
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Table 5-2: 


Figure 5-1: 
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Table 5-2 contains the following addition 
errors: 


Symbol A1 should have a max tolerance 
of 3.50mm. 


Symbol A should have a max tolerance 
of 33.98mm. 


Symbol A should have a max tolerance 
of 1.338” . 


Symbol “A” has been corrected to rep- 


‘resent the dimension from the package 


lid to the minimum required free air- 
space. 
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® 
For more information regarding the Pentium™ Proc- Also listed in this chapter are brief technical profiles of 
essor, you may obtain the Pentium Processor User’s other Pentium Processor peripheral products, including 


Manual by calling our toll-free literature distribution the 82489DX Advanced Interrupt Controller and the 
center at 1-800-548-4725. The Pentium Processor User’s 82430 PClset. For more information on these products, 
Manual is a three-volume set with details on Pentium please consult the 1994 Peripherals Handbook. 
Processor hardware and software design parameters, 

with specifications also for the 82496 Advanced Cache 

Controller and 82491 Cache SRAM. 
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82496 CACHE CONTROLLER AND 
82491 CACHE SRAM 
FOR USE WITH THE Pentium™ PROCESSOR 


= High Performance Second Level Cache @ Highly Flexible 


— Zero Wait States at 66 MHz — 256K to 512K with parity 
— Two-way Set Associative — 32, 64, or 128-Bit Wide Memory Bus 
— Write-Back with MESI Protocol — Synchronous, Asynchronous, and 
— Concurrent CPU Bus and Memory Strobed Memory Bus Operation 
Bus Operation — Selectable Bus Widths, Line Sizes, 
— Boundary Scan Transfers, and Burst Orders 
@ Pentium Processor m Full Multiprocessing Support 
— Chip Set Version of Pentium — Concurrent CPU, Memory Bus, and 
Processor Snoop Operations 
— Superscalar Architecture — Complete MES! Protocol 
— Enhanced Floating Point — Internal/External Parity Generation/ 
— On-chip 8K Code and 8K Data Checking 
Caches — Supports Read-for Ownership, Write- 
— See Pentium™ Processor User’s Allocation, and Cache-to-Cache 
Manual Volume 2 for more Transfers 
information 


The 82496 Cache Controller and multiple 82491 Cache SRAMs combine with the Pentium processor to form a 
CPU Cache chip set designed for high performance servers and function-rich desktops. The high speed 
interconnect between the CPU and cache components has been optimized to provide zero-wait state opera- 
tion. This CPU Cache chip set is fully compatible with existing software, and has new data integrity features for 
mission critical applications. 

The 82496 cache controller implements the MESI write-back protocol for full multiprocessing support. Dual 
ported buffers and registers allow the 82496 to concurrently handle CPU bus, memory bus, and internal cache 
operation for maximum performance. 

The 82491 is a customized high-performance SRAM that supports 32, 64, and 128-bit wide memory bus 
widths, 16, 32, and 64 byte line sizes, and optional sectoring. The data path between the CPU bus and 
memory bus is separated by the 82491, allowing the CPU bus to handshake synchronously, asynchronously, 
or with a strobed protocol, and allowing concurrent CPU bus and memory bus operations. 
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82430 PCiset 
FOR THE Pentium™ PROCESSOR 


= Supports the Pentium™ Processor at 
60 MHz or 66 MHz 


@ Interfaces the Host and Standard 
Buses to the Peripheral Component 
interconnect (PCi) Local Bus Operating 
at 30 MHz or 33 MHz 
— Up to 132 Mbytes/sec Transfer Rate 
— Full Concurrency between CPU Host 

Bus and PCI Bus Transactions 


m@ Integrated Cache Controller Provided 
for Optional Second Level Cache 
— 256 Kbyte or 512 Kbyte Cache 
— Write-Back or Write-Through Policy 
— Standard or Burst SRAM 


m Integrated Tag RAM for Cost Savings 
on Second Level Cache 


@ Provides a 64-Bit Interface to DRAM 
Memory 
— From 2 Mbytes to 192 Mbytes of 
Main Memory 
— 70 ns and 60 ns DRAMs Supported 


m Supports the Pipelined Address Mode 
of the Pentium Processor for Higher 
Performance 


@ Optional ISA or EISA Standard Bus 


interface 

— Single Component ISA Controller 

— Two Component EISA Bus Interface 
— Minimal External Logic Required 


Supports Burst Read and Writes of 
Memory from the Host and PCI Buses 


Five Integrated Write Posting and Read 
Prefetch Buffers Increase CPU and PCI 
Master Performance 


Host CPU Writes to PCI in Zero Wait 
State PCl Bursts with Optional TRDY # 
Connection 


Integrated Low Skew Host Bus Clock 
Driver for Cost and Board Space 
Savings 


PCiset Operates Synchronous to the 
66 MHz CPU and 33 MHz PCI Clocks 


Byte Parity Support for the Host/PCli 

and Main Memory Buses 

— Optional Parity on the Second Level 
Cache 


The 82430 PClset provides the Host/PClI bridge, cache/main memory controller, and an |/O subsystem core 
(either PCI/EISA or PCI/ISA bridge) for the next generation of high-performance personal computers based 
on the Pentium Processor. System designers can take advantage of the power of the PCI (Peripheral Compo- 
nent Interconnect) bus for the local |/O while maintaining access to the large base of EISA and ISA expansion 
cards, and corresponding software applications. Extensive buffering and buffer management within the bridg- 
es ensures maximum efficiency in all three bus environments (Host CPU, PCI, and EISA/ISA Buses). 


The 82430 PCliset consists of the 82434LX PCI/Cache/Memory Controller (PCMC) and the 82433LX Local 
Bus Accelerator (LBX) components, plus, either a PCI/ISA bridge or a PCI/EISA bridge. The PCMC and LBX 
provide the core cache and main memory architecture and serve as the Host/PCI bridge. For an |ISA-based 
system, the 82430 PClset includes the 82378 System I/O (SIO) component as the PCI/ISA bridge. For an 
EISA-based system, the 82430 PClset includes the 82375EB PCI/EISA Bridge (PCEB) and the 82374EB EISA 
System Component (ESC). The PCEB and ESC work in tandem to form the complete PCI/EISA bridge. Both 
the ISA and EISA-based systems are shown on the following pages. 


For complete data sheets on all these devices, refer to Order Number 290482 and 290483. 
Pentium is a trademark of Intel Corporation. 
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82430 PCiset ISA Block Diagram 


Pentium™ Processor 
60 or 66 MHz 


Host Bus 


ISA Bus 


290481-1 


2-46 


intel ® | : 82430 PCiset 


82430 PCiset EISA Block Diagram 


Pentium™ Processor 
60 or 66 MHz 


Host Bus 
Control 


_ Address 
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82434LX 


PCI/CACHE/MEMORY CONTROLLER (PCMC) 

m@ Supports the 64-Bit Pentium™ m Integrated DRAM Controller 

Processor at 60 MHz and 66 MHz — Supports 2 MBytes to 192 MBytes of 
nali , Cacheable Main Memory 

= Supports Pipelined Addressing ; 
Capability of the Pentium — Supports DRAM Access Times of 
Microprocessor 70 ns and 60 ns 

— CPU Writes Posted to DRAM at 

m@ High Performance CPU/PCI/Memory 4-1-1-1 
interfaces via Posted-Write/Read- — Refresh Cycles Decoupled from ISA 
Prefetch Buffers Refresh to Reduce the DRAM 

m Fully Synchronous 33 MHz PCI Bus Access Latency 
Interface with Full Bus Master — Refresh by RAS#-Only, or CAS#- 
Capability before-RAS #, in Single or Burst of 


, Four 
m= Supports the Pentium Processor 
Primary Cache in either Write-Through m Host/PCl Bridge 


or Write-Back Mode — Translates CPU Cycles into PCI Bus 
; Cycles 
. i ti SL aeE Map of DOS — Translates Back-to-Back Sequential 
na rare egions for system CPU Memory Writes into PCI Burst 
Seam Cycles 
= Integrated Low Skew Clock Driver for — Burst Mode Writes to PCI in Zero PCI 
Distributing 66 MHz Clock Wait States (i.e., Data Transfer Every 
Cycle) 
2 rp ita a Sechaba — Full Concurrency between CPU-to- 
— Integrated Cache Tag RAM cid rishi and PCI-to-PCI 
— Write-Through and Write-Back Cache alpaca nisl 
Modes — Full Concurrency between CPU-to- 
— Direct-Mapped Organization Second Level Cache and PCI-to-Main 
— Supports Standard and Burst SRAMs etl hy eciapey a 
— 256 KByte and 512 KByte Sizes ee eee Se roe 
— Cache Hit Cycle of 3-1-1-1 on Reads System Logic Design for ISA or EISA 
and Writes Using Burst SRAMs Systems 
— Cache Hit Cycle of 3-2-2-2 on Reads — Cache Snoop Filter Ensures Data 
and 4-2-2-2 on Writes Using Consistency for PCl-to-Main Memory 
Standard SRAMs Transactions 
m PCMC (208-Pin QFP Package) Uses 5V 
CMOS Technology 


The 82434LX PCI, Cache, Memory Controller (PCMC) integrates the cache and main memory DRAM control 
functions and provides the bus control for transfers between the CPU, cache, main memory, and the Peripher- 
al Component Interconnect (PCI) Local Bus. The cache controller supports both write-through and write-back 
cache policies and cache sizes of 256 KBytes and 512 KBytes. The cache memory can be implemented with 
either standard or burst SRAMs. The PCMC cache controller integrates a high-performance Tag RAM to 
reduce system cost. Up to twelve single-sided SIMMs or six double-sided SIMMs provide a maximum of 192 
MBytes of main memory. The PCMC is intended to be used with the 82433LX Local Bus Accelerator (LBX). 
The LBX provides the Host-to-PCl address path and data paths between the CPU/cache, main memory, and 
PCl. The LBX also contains posted write buffers and read-prefetch buffers. Together, these two components 
provide a full function data path to main memory and form a PCI bridge to the CPU/Cache and DRAM 
subsystem. 
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82433LX 
LOCAL BUS ACCELERATOR (LBX) 


Supports the Full 64-Bit Pentium™ 
Processor Data Bus at 66 MHz 


Provides a 64-Bit Interface to DRAM 
and a 32-Bit Interface to PCI 


Five Integrated Write Posting and Read 

Prefetch Buffers Increase CPU and PCI 

Master Performance 

— CPU-to-Memory Posted Write Buffer 
4 Qwords Deep 

— PCli-to-Memory Posted Write Buffer 
Two Buffers, 4 Dwords Each 

— PCi-to-Memory Read Prefetch Buffer 
4 Qwords Deep 

— CPU-to-PCl Posted Write Buffer 
4 Dwords Deep 

— CPU-to-PCl Read Prefetch Buffer 
4 Dwords Deep 


Host-to-Memory and Host-to-PCI Write 
Posting Buffers Accelerate Write 
Performance 


Dual-Port Architecture Allows 
Concurrent Operations on the Host and 
PCi Buses 


Operates Synchronous to the 66 MHz 
CPU and 33 MHz PCI Clocks 


Supports Burst Read and Writes of 
Memory from the Host and PCI Buses 


Sequential CPU Writes to PCI 
Converted to Zero Wait State PCI 
Bursts with Optional TRDY # 
Connection 


Byte Parity Support for the Host and 

Memory Buses 

— Optional Parity Generation for Host 
to Memory Transfers 

— Optional Parity Checking for the 
Secondary Cache Residing on the 
Host Data Bus 

— Parity Checking for Host and PCI 
Memory Reads 

— Parity Generation for PCI to Memory 
Writes 


160-Pin QFP Package 
5V CMOS Technology 


Two 82433LX Local Bus Accelerator (LBX) components provide a 64-bit data path between the Host CPU/ 
cache and main memory, a 32-bit data path between the Host CPU bus and the PCI Local Bus, and a 32-bit 
data path between the PCI local bus and main memory. The dual-port architecture allows concurrent opera- 
tions on the Host and PCI Buses. The LBXs incorporate three write posting buffers and two read prefetch 
buffers to increase Pentium processor and PCI Master performance. The LBX supports byte parity for the Host 
and main memory buses. The LBX is intended to be used with the 82434LX PCI/Cache/Memory Controller 
(PCMC). During bus operations between the Host, main memory, and PCI, the PCMC commands the LBXs to 
perform functions such as latching address and data, merging data, and enabling output buffers. Together, 
these three components form a ‘‘Host Bridge’ which provides a full function dual-port data path interface, 
linking the Host CPU and PCI bus to main memory. 
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82375EB 
PCI/EISA BRIDGE (PCEB) 


Provides the Bridge between the PCI m@ 32-Bit Data Paths 
Bus and EISA Bus m Integrated EISA Data Swap Buffers 
100% PCI and EISA Compatible —" PCI Devices 
—PCI and EISA Master/Slave Interface" ATR UTG Tet ee eer s 
— Directly Drives 10 PCI Loads and 8 — Fixed, Rotating, or a Combination of 
EISA Slots the Two 
— Supports PCI at 25 MHz to 33.33 MHz 
m PCI and EISA Address Decoding and 
Data Buffers Improve Performance Mapping 
— Four 32-Bit PCi-to-EISA Posted Write — Positive Decode of Main Memory 
Buffers # ation 
— Four 16-Byte EISA-to-PCI Read/Write Bal spay se hi sig <I 
_ a lee Prefetch apne 
— EISA-to-PCI and PCI-to-EISA Write iat aa pense pape 
Posting ; 
m@ Programmable Main Memory Address 


Data Buffer Management Ensures Data 

Coherency 

— Flush Posted Write Buffers 

— Flush or invalidate Line Buffers 

— Instruct All PCI Devices to Flush 
Buffers Pointing to PCI Bus before 


Decoding 

— Main Memory Sizes up to 
512 MBytes 

— Access Attributes for 15 Memory 
Segments in First 1 MByte of Main 
Memory 


Granting EISA Access to PCI 


@ Burst Transfers on both the PCI and 
EISA Buses 


— Programmable Main Memory Hole 
m Integrated 16-Bit BIOS Timer 
m 208-Pin QFP Package 
m 5V CMOS Technology 


The 82375EB PCI-EISA Bridge (PCEB) provides the master/slave functions on both the Peripheral Compo- 
nent Interconnect (PCl) Local Bus and the EISA Bus. Functioning as a bridge between the PCI and EISA 
buses, the PCEB provides the address and data paths, bus controls, and bus protocol translation for PCl-to- 
EISA and EISA-to-PCl transfers. Extensive data buffering in both directions increases system performance by 
maximizing PCI and EISA Bus efficiency and allowing concurrency on the two buses. The PCEB’s buffer 
management mechanism ensures data coherency. The PCEB integrates central bus control functions includ- 
ing a programmable bus arbiter for the PCI Bus and EISA data swap logic for the EISA Bus. Integrated system 
functions include PCI parity generation, system error reporting, and programmable PCI and EISA memory and 
I/O address space mapping and decoding. The PCEB also contains a BIOS Timer that can be used to 
implement timing loops. The PCEB is intended to be used with the EISA System Component (ESC) to provide 
an EISA I/O subsystem interface. 
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82374EB 
EISA SYSTEM CONTROLLER (ESC) 


integrates EISA Compatible Bus 

Controller 

— Translates Cycles between EISA and 
ISA Bus 

— Supports EISA Burst and Standard 
Cycles 

— Supports ISA No Wait State Cycles 

— Supports Byte Assembly/ 
Disassembly for 8-Bit, 16-Bit and 
32-Bit Transfers 

— Supports Bus Frequency up to 
8.33 MHz 


Supports Eight EISA Slots 

— Directly Drives Address, Data and 
Control Signals for Eight Slots 

— Decodes Address for Eight Slot 
Specific AENs 


Provides Enhanced DMA Controller 

— Provides Scatter-Gather Function 

— Supports Type A, Type B, Type C 
(Burst), and Compatible DMA 
Transfers 

— Provides Seven Independently 
Programmable Channels 

— Integrates Two 82C37A Compatible 
DMA Controllers 


Provides High Performance Arbitration 

— Supports Eight EISA Masters and 
PCEB 

— Supports ISA Masters, DMA 
Channels, and Refresh 

— Provides Programmable Arbitration 
Scheme for Fixed, Rotating, or 
Combination Priority 


Integrates Support Logic for X-Bus 

Peripherals and More 

— Generates Chip Selects/Encoded 
Chip Selects for Floppy and 
Keyboard Controller, IDE, 
Parallel/Serial Ports, and General 
Purpose Peripherals 

— Provides Interface for Real Time 
Clock 

— Generates Control Signals for X-Bus 
Data Transceiver 

— Integrates Port 92, Mouse Interrupt, 
and Coprocessor Error Reporting 


Integrates the Functionality of Two 

82C59 Interrupt Controllers and Two 

82C54 Timers 

— Provides 14 Programmable Channels 
for Edge or Level Interrupts 

— Provides 4 PCI Interrupts Routable 
to Any of 11 Interrupt Channels 

— Supports Timer Function for Refresh 
Request, System Timer, Speaker 
Tone, Fail Safe Timer, and Periodic 
CPU Speed Control | 


Generates Non-Maskable Interrupts 
(NMI) 

— PCI System Errors 

— PCI Parity Errors 

— EISA Bus Parity Errors 

— Fail Safe Timer 

— Bus Timeout 

— Via Software Control 


Provides BIOS Interface 

— Supports 512 KBytes of Flash or 
EPROM BIOS on the X-Bus 

— Allows BIOS on PCI 

— Supports Integrated VGA BIOS 


208-Pin QFP Package 
5V CMOS Technology 


The 82374EB EISA System Component (ESC) provides all the EISA system compatible functions. The ESC, 
with the PCEB, provides all the functions to implement an EISA to PCI bridge and EISA I/O subsystem. The 
ESC integrates the common I/O functions found in today’s EISA based PC systems. The ESC incorporates the 
logic for an EISA (master and slave) interface, EISA Bus Controller, enhanced seven channel DMA controller 
with Scatter-Gather support, EISA arbitration, 14 channel interrupt controller, five programmable timer/coun- 
ters, and non-maskable interrupt (NMI) control logic. The ESC also integrates support logic to decode periph- 
eral devices such as the Flash BIOS, Real Time Clock, Keyboard/Mouse Controller, Floppy Controller, two 
Serial Ports, one Parallel Port, and IDE Hard Disk Drive. 
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ESC Block Diagram 


PCICLK 
PERR# 
SERR# 

RESET# 


PCI — BCLKOUT 
Bus BCLK 
Interface LA[31:27]#/CPG[4:0] 
LA[26:24]# 
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PEREQ#/INTA# EISA . W/R# 
NMFLUSH# Bus EX32# 
ESC/ Interface EX16# 
SDCPYEN[13,3:1]# PCEB START# 
SDCPYUP Interface CMD# 
SDOE[2:0]# A EXRDY 
SDLE[3:0}# SLBURST# 
MSBURST# 
INTCHIPO MASTER16# 


SPKR <— SD[7:0] 
SLOWH# Timers 
_—- BALE 


IRQ(15:9,7:3, 1] 
IRQ8# 

INT 

NMI 


SALE#] 
LASAOE# 
SALAOE# 

FERR# 

IGNNE# 

LBIOSCS# 
KYBDCS# 
ALTRST# 

ALTA20 

ABFULL 

RTCALE 

RTCRD# 

RTCWR# 

FDCCS# 

DSKCHG 

DLIGHT# 
CRAMRD# 
CRAMWR# 
XBUSTR# 
XBUSOE# 
GPCS[2:0}#/ECS[2:0] 
AEN[4:1)/E AEN[4:1] 


Interrupt 
Controller 


Integrated 
Support 
Logic 


DMA 
Controller 
w S-G 


EISA 
Arbiter 


SA[1:0] 
SBHE# 
M16# 
1O16# 
MRDC# 
MWTC# 
SMRDC# 
SMWTC# 


REFRESH# 


> RSTDRV 


AEN# 
DREQ[7:5,3:0] 
DACK[7:5,3:0] 
EOP 


MREQ{7:4]#/PIRQ(0:3]# 


MREQ(3:0]# 


—® MACK[(3:0]}#/EMACK[3:0] 
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82378 
SYSTEM I/O (SIO) 
m Provides the Bridge between the PCI @ Utility Bus (X-Bus) Peripheral Support 
Bus and ISA Bus — Provides Chip Select Decode 
m 100% PCI and ISA Compatible — Controls Lower X-Bus Data Byte 
— PCI and ISA Master/Slave Interface Transceiver 
— Directly Drives 10 PCI Loads and 6 — Integrates Port 92, Mouse Interrupt, 
ISA Slots Coprocessor Error Reporting 
— Supports PCI at 25 MHz and = Integrates the Functionality of One 
33.33 MHz 82C54 Timer 
— Supports ISA from 6 MHz to — System Timer 
8.33 MHz — Refresh Request 
mw Enhanced DMA Functions — Speaker Tone Output 
— Scatter/Gather m Integrates the Functionality of Two 
— Fast DMA Type A, B, and F 82C59 Interrupt Controllers 
— Compatible DMA Transfers — 14 Interrupts Supported 
— 32-Bit Addressability . 
Big ee? Programmable ¥ pees tan Alan pepe 
— Functionality of Two 82C37A DMA ia Eee 
Controllers 208-Pin QFP Package 


m Integrated Data Buffers to Improve 5V CMOS Technology 


Performance 

— 8-Byte DMA/ISA Master Line Buffer 

— 32-Bit Posted Memory Write Buffer 
to ISA 


mg Integrated 16-Bit BIOS Timer 


@ Arbitration for PCI Devices 
— Two or Four External PCI Masters 
Are Supported 
— Fixed, Rotating, or a Combination of 
the Two 


w Arbitration for ISA Devices 
— ISA Masters 
— DMA and Refresh 


The 82378 System I/O (SIO) component provides the bridge between the PCI local bus and the ISA expansion 
bus. The SIO also integrates many of the common I/O functions found in today’s ISA based PC systems. The 
SIO incorporates the logic for a PCI interface (master and slave), ISA interface (master and slave), enhanced 
seven channel DMA controller that supports fast DMA transfers and Scatter/Gather, data buffers to isolate the 
PCI bus from the ISA bus and to enhance performance, PCI and ISA arbitration, 14 level interrupt controller, a 
16-bit BIOS timer, three programmable timer/counters, and non-maskable-interrupt (NMI) control logic. The 
SIO also provides decode for peripheral devices such as the Flash BIOS, Real Time Clock, Keyboard/Mouse 
Controller, Floppy Controller, two Serial Ports, one Parallel Port, and IDE Hard Disk Drive. 


IMPORTANT—READ THIS SECTION BEFORE READING THE REST OF THE DATA SHEET. 
This data sheet describes the 82378IB and 82378ZB components. All normal text describes the functional- 


ity for both components. All features that exist on the 82378ZB are shaded as shown below. 
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SD[15:0] 
SA[19:0] 
LA[23:17] 
lOCS16# 
MEMCS16# 
SBHE# 
MASTER# 
MEMR# 
MEMW# 
AEN 
IOCHRDY 
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SYSCLK 
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ADVANCE INFORMATION 
82489DX 


ADVANCED PROGRAMMABLE 
INTERRUPT CONTROLLER 


82489DX FEATURES OVERVIEW m Inter-Processor Interrupts 


@ Advanced interrupt Controller for m Various Addressing Schemes— 
32-Bit Operating Systems Broadcast, Fixed, Lowest Priority, etc. 


@ Solution for Multiprocessor Interrupt 
Management 


m@ Dynamic Interrupt Distribution for Load 
Balancing in MP Systems 


m@ Separate Nibble Bus (Interrupt 
Controller Communications (ICC) Bus) 


= Compatibility Mode with 8259A 

m@ 32-Bit Internal Registers 

m@ Integrated Timer Support 

m@ 33 MHz Operation 

m 132-Lead PQFP Package, Package 


for Interrupt Messages Type KU 


RESET 
CLKIN 
ADS 
BGT 
DLE 
M/I0 
D/C 
W/R 
cs 
RDY 
ExtINTA 


A[ 10:3] 


TCK 
TRST 
TDI 
TMS 
TDO 


(See Packaging Specification. Order Number: 240800) 


82489DX Block Diagram 
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32-BITF INTERFACE BATA 
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Compatible 
Boundary 
Scan TAP 
Controller 
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Refer to Application Note AP-388: 82489DX User’s Manual (Order Number 292116) when evaluating your 
design needs. 
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1.0 INTRODUCTION 


The 82489DX Advanced Programmable Interrupt 
Controller provides multiprocessor interrupt man- 
agement, providing both static and dynamic sym- 
metrical Interrupt distribution across all processors. 


The main function of the 82489DxX is to provide in- 
terrupt management across all processors. This dy- 
namic interrupt distribution includes routing of the in- 
terrupt to the Jlowest-priority processor. The 
82489DxX works in systems with multiple |/O subsys- 
tems, where each subsystem can have its own set 
of interrupts. This chip also provides inter-processor 
interrupts, allowing any processor to interrupt any 
processor or set of processors. Each 82489DX I/O 
unit Interrupt Input pin is individually programmable 
by software as either edge or level triggered. The 
interrupt vector and interrupt steering information 
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Task Priority Reg 


Internal DATA/ADDR 
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can be specified per pin. A 32-bit wide timer is pro- 
vided that can be programmed to interrupt the local 
processor. The timer can be used as a counter to 
provide a time base to software running on the proc- 
essor, or to generate time slice interrupts locally to 
that processor. The 82489DX provides 32-bit soft- 
ware access to its internal registers. Since no 
82489DX register reads have any side effects, the 
82489DxX registers can be aliased to a user read- 
only page for fast user access (e.g., performance 
monitoring timers). | 


The 82489DX supports a generalized naming/ad- 
dressing scheme that can be tailored by software to 
fit a variety of system architectures and usage mod- 
els. It also supports 8259A compatibility by becom- 
ing virtually transparent with regard to an externally 
connected 8259A style controller, making the 8259A 
visible to software. 


PNMI.PRST 
EXTINTA 


4-Bit Open 
Drain ICC 


Bus 
Interrupt 


Message 
Acceptance 


ICC BUS 


MESSAGE 
INTERFACE 


Interrupt Line 


Interrupt 


1/0 Select Reg. 
1/0 Unit ID Reg. 
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Figure 1. 82489DX Architecture 
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2.0 FUNCTIONAL OVERVIEW 


82489DX Functional Blocks 


82489DX contains one Local Unit, one I/O unit and 
a timer. The ICC bus is used to pass interrupt mes- 
sages. 


ICC BUS 


The ICC bus is a 5-wire synchronous bus connecting 
all 82489DxXs (all |/O Untis and all Local Units). The 
Local Units and |/O Units communicate over this 
ICC bus. Four of these five wires are used for data 
transmissions and arbitration, and one wire is a 
clock. 


LOCAL UNIT 


The Local Unit contains the necessary intelligence 
to determine whether or not its processor should ac- 
cept interrupt messages sent on the ICC bus by oth- 
er Local Units and |/O Units. The Local Unit also 
provides local pending of interrupts, nesting and 
masking of interrupts, and handles all interactions 
with its local processor such as the INT/INTA/EOI 
protocol. The Local Unit further provides inter-proc- 
essor interrupt functionality and a timer to its local 
processor. The interface of a processor to its 
82489DX Local Unit is identical for every processor. 


1/O UNIT 


The I/O Unit provides the interrupt input pins on 
which I/O devices inject interrupts into the system in 
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the form of an edge or a level. The I/O unit also 
contains a Redirection Table for the interrupt input 
pins. Each entry in the Redirection Table can be in- 
dividually programmed to indicate whether an inter- 
rupt on the pin is recognized as either an edge or a 
level; what vector and also what priority the interrupt 
has; and which of all possible processors should 
service the interrupt and how to select that proces- 
sor (statically or dynamically). The information in the 
table is used to send interrupt messages to all 
82489DX Units via the ICC bus. 


TIMER 


The 82489DxX provides a 32-bit wide timer that can 
be programmed to interrupt the local processor. The 
timer can be used as a counter to provide a time- 
base to software running on the processor, or to 
generate time-slice interrupts local to that proces- 
sor. 


3.0 PIN DESCRIPTION 


The 82489DxX pin description is organized in a small 
number of functional groups. Pin definitions and pro- 
tocols have been designed to minimize interface is- 
sues. In particular, they support the notion of inde- 
pendently controlled address and data phases. The 
primary host interface is synchronous in nature. 


In the following pin definition table if the signal name 
has (__) over it, the signal is in its active state when it 
has a low level. The signal direction column identi- 
fies output only signals as a continuous drive (QO), 
tristate (T/S), or open drain (O/D). All bi-directional 
(BI-D) signals have tri-stating outputs. 
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Pin Definition Table 
MnO eM Lge” ey | analog Neo 8 + a ey 
SYSTEM PINS 


The RESET INPUT forces 82489Dx to enter its initial state. The 82489DX 
Local Unit in turn asserts it PRST (Processor Reset) output. All tri-state 
outputs remain in high impedance until explicitly enabled. 


The EXTERNAL INTERRUPT ACKNOWLEDGE output is asserted (high) 
when an external interrupt controller (e.g., 8259) is expected to respond to 
the current INTA cycle. If deasserted (low), 82489DxX will respond, and the 
INTA cycle must not be delivered to the external controller. 


CLKIN HOUKIN > | By |: de CLOCK INPUT provides reference timing for most of the bus signals. 
TEST RESET is the JTAG compatible boundary scan TAP controller reset 
pin. A weak pull-up keeps the pin high if not driven. 
TEST CLOCK is the clock input for the JTAG compatible boundary scan 
controller and latches. 
TEST DATA INPUT is the test data input pin for the JTAG compatible 
boundary scan chain and TAP controller. A weak pull-up keeps this pin high 
if not driven. 

| TEST DATA OUTPUT is the test data output for the JTAG compatible 
boundary scan chain. 


TMS 54 TEST MODE SELECT is the test mode select pin for the JTAG boundary 
scan TAP controller. A weak pull-up keeps this pin high if not driven. 


TIMER PIN 


TMBASE The TIME BASE input provides a standard frequency that is only used by 
the 82489DxX timer and that is independent of the system clock. 


INTERRUPT PINS 


INTIN[15:0] | 82-97 These 16 INTERRUPT INPUT pins accept edge or level sensitive interrupt 
requests from I/O or other devices. The pin numbers are specified 
respectively. INTIN15 corresponds to pin number 82, INTIN14 corresponds 
to pin number 83 etc., and INTINO corresponds to pin number 97. These 
pins are active high. 


LINTIN[1] 80 Two LOCAL INTERRUPT INPUT pins accept edge or level sensitive 
LINTIN [0] 81 interrupt requests that can only be delivered to the connected processor. 


These pins are active high. 


REGISTER ACCESS PINS 


ADS 64 ADDRESS STROBE signal indicating the start of a bus cycle. 82489DX 
‘ does not commit to start the cycle internally until BUS GRANT is detected 
active. 
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Pin Definition Table (Continued) 


peymbol |"ela No. | Type | fw Sa Phnatien a" es 5 Ses Spain 


REGISTER ACCESS PINS (Continued) 


Bus cycle definition signals. Note that since the 82489DX registers can be 
mapped in either memory or |/O space, the M/IO pin is not used for register 
access cycles; it is only used to decode interrupt acknowledge cycles. 

82489DxX does not respond to code read cycles. 


The BUS GRANT input is optional and is used to indicate the address phase of 
a bus cycle in configurations where address timing cannot be inferred from 
ADS. This signal is really used as an address latch enable, but is named as it is 
to indicate that it can normally be connected to the Intel Cache Controller 
generated signal of the same name. Must be tied low if not used. 


DATA LATCH/ENABLE is optional and is used to indicate committing the data 
phase of a bus cycle in configurations where data timing cannot be inferred 
from other cycle timings. Must be tied low if not used. 


BI-D | The DATA BUS is for all register accesses and interrupt vectoring. 
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Pin Definition Table (Continued) 


| Syrobol | Pati ibe | vane = Frination: is er) oon eas 


REGISTER ACCESS PINS (Continued) 


One Data Parity pin for each byte on the data bus. EVEN parity is generated 
any time the data bus is driven by the 82489DxX. 


43 READY output indicates that the current bus cycle is complete. In the case of 
a read cycle, valid data and the return to inactive state after going active low 


may be delayed till DLE goes active. 


PROCESSOR PINS 


The PROCESSOR INTERRUPT OUTPUT indicates to the processor that one 
or more maskable interrupts are pending. This pin is tri-stated at reset, and has 
an internal pull-down resistor to prevent false signaling to the processor until 
the 82489DxX Local Unit is enabled and this pin is actively driven. 


The PROCESSOR RESET OUTPUT is asserted/de-asserted upon 82489DX 
reset, and also in response to ICC bus messages with “RESET” delivery 
mode. This pin should be used with care. 


The NON-MASKABLE INTERRUPT output is signaled in respone to ICC bus 
messages with “NMI” delivery mode. This pin is tri-stated at reset, and has an 
internal pull-down resistor to prevent false signaling to the processor until the 
Local Unit is enabled and this pin is actively driven. 


ICLK | 60 | 1 | The ICC BUS CLOCK input provides synchronous operation of the ICC bus. 


MBI[3:0] | 76-79 The four ICC BUS IN inputs are used for incoming ICC bus messages. In 
smaller configurations the ICC bus input and outputs may be tied directly 
together at the pins. Pin number for MBI3 is 76, MBI2 is 77, MBI1 is 78 and 
MBIO is 79. 

45 O/D 


The four ICC BUS OUT outputs are used for outgoing ICC bus messages. The 
48 current capacity is only 4 mA. So external bufferes will be needed. 
49 
51 
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Pin Definition Table (Continued) 
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RESERVED PINS 


34, 42 These pins MUST BE LEFT OPEN. 
Reserved 70, 72,15 ba Reserved by Intel. These pins should be strapped to Vcc. 
71, 18,20 ea: ey Reserved by Intel. These pins should be strapped to GND. 


POWER AND GROUND PINS 


Voc 1,32,69,98 | POWER | Nominally + 5V. These pins along with Vss and Vss; should be 
separately bypassed. 
Vocp 6, 15, 25,100, |} POWER | Nominally + 5V. These pins along with Vssp should be separately 
108, 117, 126 bypassed. 
VocPo 39, 46 POWER | Nominally + 5V. These pins along with Vsspo should be separately 
bypassed. 
Nominally OV. These pins along with Vcc should be separately 
bypassed. | 
10, 17, 22, 30, GND 
106, 113, 120, 
127, 132, 


Nominally OV. These pins along with Vccp should be separately 
bypassed. 
Vsspo 36, 40, 44, GND 
47, 50 
GND | Nominally OV. These pins along with Vcc should be separately 
bypassed. 
NOTE: 


Voc. Vocp and Vccpo should be of same voltage. Vss, Vssp, Vsspo and Vsg; should be OV. 


4.0 FUNCTIONAL DESCRIPTION 1/0 Unit 


As far as interrupt management is concerned, the The I/O Unit consists of a set of Interrupt Input pins, 
82489DX’s interrupt control function spans over two an Interrupt Redirection Table, and a message unit 
functional units, the I/O Unit of which there is one for sending and receiving messages from the ICC 
per |/O subsystem, and the Local Unit of which bus. The I/O Unit is where |/O devices inject their 
there is one per processor. 82489DX has one I/O interrupts, the |/O Unit selects the corresponding 
unit and one Local Unit in a single package. This entry in the Redirection Table and uses the informa- 
section takes a detailed look at both local and I/O tion in that entry to format an interrupt request 
Units. message. The message unit then broadcasts this 
message over the ICC bus. The content of the Redi- 
rection Table is under software control and is as- 
signed benign defaults upon reset. The masks in the 
Redirectional Table entries are set to 1 at hardware 
reset to disable the interrupts. 


ADVANCE INFORMATION 2-67 


82489DX intel : 


DATA/ADDR 


FHM eB Pt Bde AC TG ETERS Wd Bs UL cea cre ee acelin | 


— 


eae Bus 
Se send eae receive 


lls. | 


EDGE SENSE 
bCewees ws = = we 


INTERRUPT INPUT LINES 


| 
Table 


1/0 UNIT ID REG 1/O UNIT VERSION REG 


[a 


J 
a 
| 
& 
LJ 
i 
I 
H 
t 
] 
J 
| 
: 
val 


290446-3 


Figure 2. 82489DX I/O Unit Block Diagram 


Local Unit mode of the interrupt, zero, one or more units can 

accept an interrupt. A Local Unit accepts an inter- 
Interrupt Management of the Local Unit is responsi- rupt only if it will deliver the interrupt to its processor. 
ble for local interrupt sources, interrupt acceptance, Accepting an interrupt is purely an inter-82489DX 
dispensing interrupts to the processor, and sending matter; dispensing an interrupt to the local proces- 


inter-processor interrupts. Depending on the delivery —_—sor only involves a 82489DxX and its local processor. 
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Figure 3. 82489DX Local Unit Block Diagram 
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5.0 INTERRUPT CONTROL 
MECHANISM 


This section describes briefly the interrupt control 
mechanism in the 82489DX. 


5.1 Interrupts 


The interrupt control function of all 82489DXs are 
collectively responsible for delivering interrupts from 
interrupt sources to interrupt destinations in the mul- 
tiprocossor system. When a processor accepts an 
interrupt, it uses the vector to locate the entry point 
of the handler in its interrupt table. The 82489DX 
architecture allows for 16 possible interrupt priori- 
ties; zero being the lowest priority and 15 being the 
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highest. Priority of interrupt A “is higher than” the 
priority of interrupt B if servicing A is more urgent 
than servicing B. An interrupt’s priority is implied by 
its vector; namely priority = vector/16. 


With 256 vectors and 16 different priorities, this im- 
plies that 16 different interrupt vectors can share a 
single interrupt priority. 


TOTAL ALLOWED INTERRUPT VECTORS 


Out of 256 vectors, interrupt vectors 0 to 15 should 
not be used in the 82489DX. Only 240 interrupt vec- 
tors (vectors from 16 to 255) are supported in the 
82489DX. 
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Figure 4. 1/O Units and Local Units 
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INTERRUPT SOURCES © 


Interrupts are generated by a number of different in- 
terrupt sources in the system. 


Possible interrupt sources are: 


e Externally connected (I/O) devices. Interrupts 
from these external sources manifest themselves 
as edges or levels on interrupt input pins and can 
be redirected to any processor. 


e Locally connected devices. These originate as 
edges or levels on interrupt pins, but they are al 
ways directed to the local processor only. 


e 82489DX timer generated interrupts. Like locally 
connected devices, 82489DX timer can only in- 
terrupt its local processor. 


e Processors. A processor can interrupt any indi- 
vidual processor or sets of processors. This sup- 
ports software self-interrupts, preemptive sched- 
uling, TLB flushing, and interrupt forwarding. A 
processor generates interrupts by writing to the 
interrupt command register in its Local Unit. 


INTERRUPT DESTINATIONS 


|/O Units can only source interrupts whereas Local 
Units can both source and accept interrupts, so 
whenever “interrupt destination” is discussed, it is 
implied that the Local Unit is the destination of the 
interrupt. In physical mode the destination processor 
is specified by a unique 8-bit 82489DX local ID. Only 
a single destination or a broadcast to all (LOCAL ID 
of all ones) can be specified in physical destination 
mode. 


In logical mode destinations are specified using a 
32-bit destination field. All Local Units contain a 
32-bit Logical Destination register against which the 
destination field of the interrupt is matched to deter- 
mine if the receiver is being targeted by the interrupt. 
An additional 32-bit Destination Format register in 
each Local Unit enables the logical mode address- 


ing. 


INTERRUPT DELIVERY 


The description of interrupt delivery makes frequent 
use of the following terms: 


e Each processor has a processor priority that re- 
flects the relative importance of the code the 
processor is currently executing. This code can 
be part of a process or thread, or can be an inter- 
rupt handler. A processor’s priority fluctuates as a 
processor switches threads, a thread or handler 
raises and lowers its priority level to mask out 
interrupt, and the processor enters an interrupt 
handler and returns from an interrupt handler to 
previously interrupted activity. 
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e A processor is lowest priority within a given group 
of processors if its processor priority is the lowest 
of all processors in the group. Note that more 
than one processor can be the lowest priority in a 
given group. 

e A processor is the focus of an interrupt if it is 
currently servicing that interrupt, or if it currently 
has a request pending for the interrupt. 


Interrupt delivery begins with an interrupt source in- 
jecting its interrupt into the interrupt system at one of 
the 82489DX. Delivery is complete only when the 
servicing processor tells its 82489DX Local Unit it is 
complete by issuing an end-of-interrupt (EOI) com- 
mand to its 82489DX Local Unit. Only then has all 
(relevant) internal state regarding that occurrence of 
the interrupt been erased. The interrupt system 
guarantees exactly-once delivery semantics of inter- 
rupts to the specified destinations. Exactly-once 
guaranteed delivery implies a number of things: 


e The interrupt system never rejects interrupts; it 
never NAKs interrupt injection, interrupts are nev- 
er lost, and the same interrupt (occurrence) is 
never delivered more than once. 


Clearly a single edge interrupt or level interrupt 
counts as a single occurrence of an interrupt. In uni- 
processor systems, an occurrence of an interrupt 
that is already pending (IRR) cannot be distin- 
guished from the previous occurrence. All occur- 
rences are recorded in the same IRR bit. They are 
therefore treated as “the same’ interrupt occur- 
rence. 


For lowest-priority delivery mode, by delivering an 
interrupt first to its focus processor (if it currently has 
one), the identical behavior can be achieved in a MP 
(Multiprocessor) system. If an interrupt has a focus 
processor then the interrupt will be delivered to the 
interrupt’s focus processor independent of priority 
information. This means that even if there is a lower 
priority processor compared to the focus processor, 
the interrupt still gets delivered to the focus proces- 
sor. 


Each edge occurring on an edge triggered interrupt 
input pin is clearly a one-shot event; each occur- 
rence of an edge is delivered. An active level on a 
level triggered interrupt input pin represents more of 
a “continuous event’. Repeatedly broadcasting an 
interrupt message while the level is active would 
cause flooding of the ICC bus, and in effect trans- 
mits very little useful information since the same 
processor (the focus) would have to be the target. 


Instead, for level triggered interrupts the 82489DX 
merely recreate the state of the interrupt input pin at 
the destination . The source 82489DX accomplishes 
this by tracking the state of the appropriate destina- 
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tion 82489DX’s Interrupt Request Register (or pend- 
ing bit) and only sending inter-82489DX messages 
when the state of the interrupt input pin and the des- 
tination’s interrupt request enter a disagreement. 
Unlike edge triggered interrupts, when a level inter- 
rupt goes into service, the interrupt request at the 
servicing 82489DX is not automatically removed. If 
the handler of a level sensitive interrupt executes an 
EOI then that interrupt will immediately be raised to 
the processor again, unless the processor has ex- 
plicitly raised its task priority, or the source of that 
interrupt has been removed. 


5.2 Interrupt Redirection 


This section specifically talks about how a processor 
is picked during interrupt delivery. The 82489DX 
supports two modes for selecting the destination 
processor: Fixed and Lowest Priority. 


e Fixed Dellvery Mode ' 
In fixed delivery mode, the interrupt is uncondi- 
tionally delivered to all local 82489DXs _ that 
match in the destination information supplied with 
the interrupt. Note that for |/O device interrupts 
typically only a single 82489DX would be listed in 
the destination. Priority and focus information are 
ignored. If the priority of a destination processor 
equal to or higher than the priority of the interrupt, 
then the interrupt is held pending locally in the 
destination processor’s Local Unit, until the proc- 
essor priority becomes low enough at which time 
the interrupt is dispensed to the processor. More 
than one processor can be the destination in 
fixed-delivery mode. 


e Lowest Priority Dellvery Mode 

| Under the lowest priority delivery mode, the proc- 
essor to handle the interrupt is the one in the 
specified destination with the lowest processor 
priority value. If more than one processor is at the 
lowest priority, then a unique arbitration ID is 
used to break ties. For lowest priority dynamic 
delivery, the interrupt will always be taken by its 
focus processor if it has one. The lowest priority 
delivery method assures minimum interruption of 
high priority tasks. Since each Local Unit only 
knows its own processor priority, determining the 
lowest priority processor is done by arbitration on 
the ICC bus. Only one processor can be the des- 
tination in lowest-priority delivery mode. 


INTER-82489DX COMMUNICATION 


All |/O and Local Units communicate during interrupt 
delivery. Interrupt information is exchanged between 
different units on a dedicated five wire ICC bus in the 
form of broadcast messages. A 82489DX Unit’s 8-bit 
ID is used as its name for the purpose of using the 
ICC bus, and all 82489DX units using one ICC bus 
should be assigned a different ID. The Arbitration 
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ID of the Local Units used to resolve tles during low- 


est priority arbitration is also derived from the Local 
Unit’s ID. 


6.0 82489DX LOCAL UNIT 
REGISTERS DESCRIPTION 


6.1 Local Unit ID Register 


Each 82489DX Local Unit has a register that con- 
tains the Local Unit’s 8-bit ID. The Local Unit ID 
serves as a physical name of the 82489DX Local 
Unit. It can be used in specifying destination infor- 
mation and is also used for accessing the ICC bus. 
Eight address lines A[10]-A[3] are sampled on ev- 
ery clock edge while RESET is asserted. The last 
sample remains in the Local Unit ID register after 
reset. Alternatively, the ID can be loaded with a reg- 
ister write as part of software initialization. The Local 
Unit ID is read-write by software. 


Bits [91.24] | Bits 23.0) 


82489DX Local Unit ID Register 


Bits [31..24] Local Unit ID: The Local Unit ID 
serves as the physical “name” of the 
Local Unit used for addressing the 
82489DxX in physical destination mode 
and for the ICC bus usage. In a sys- 
tem with say four 82489DxX, there are 
4 Local Units and 4 I/O Units. All the 8 
units should be assigned different IDs. 
For future compatibility use only IDs 
from 0 to 14. 


Bits [23..0] Bits [23..0] are Reserved. They 
should be written with 0. 


6.2 Destination Format Register 


Interrupt Destination can be either addressed physi- 
cally or logically. When the interrupt message ad- 
dresses the destination physically, each 82489DxX in 
the ICC bus compares the address with its own unit 
ID. If the message is a broadcast type then every 
Local Unit accepts the interrupt. 


When the message addresses the destination using 
logical addressing scheme each Local Unit in the 
ICC bus compares the logical address in the inter- 
rupt message with its own Logical Destination Reg- 
ister. If there is a bit match (i.e., if at least one of the 
corresponding pair of bits match) this local unit is 
selected for delivery. : 
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All the 32 bits of Destination Format Register of all 


82489DX connected in the ICC bus should be writ- 
ten with “1” to enable the addressing scheme. 


Destination Format Register 
Logical Destination Register 


Bits [31:0] 


For future compatibility, use only bits 31-24 of Logi- 
cal Destination Register. For binary compatibility, it 
is strongly recommended that 82489DX software 
use only 8 MSB of Logical Destination Register. 


6.3 Local Interrupt Vector Table 
Registers 


The Redirection Table serves to steer interrupts that 
originate in the |/O subsystems to the processors. 
The Local Vector Table is its equivalent for inter- 
rupts that are restricted to only the local processor. 
The Local Vector Table contains three 32-bit regis- 
ters. Register 0 corresponds to the timer, registers 1 
and 2 correspond to local interrupt input pins, 
LINTINO and LINTIN1. 


The format of both the Local 0 and Local 1 interrupt 
vector tables are identical. The following register de- 
scription talks about both Local Interrupts 0,1 vec- 
tors. 


Local Interrupts 0, 1 Interrupt Vectors 


Vector: [Bits 7-0] 


This is the vector to use when gener- 
ating an interrupt for this entry. 


Delivery Mode: [Bits 10-8] 
000: Fixed 
001: <reserved> 
010: <reserved> 
011: <reserved> 
100: NMI 
101: <reserved> 


Local INTO Vector Table 


eisia1i7] | atte | atts | site | stra | ete | ett | sitstioe) | Bits 7.0 


Local INT1 Vector Table 


82489DX 


110: <reserved> 
111: ExtiINT 


000: (fixed) means deliver the signal 
on the INT pin of the local proc- 
essor. Trigger mode for ‘‘fixed”’ 
Delivery Mode can be edge or 
level. 


100: (NMI) means deliver the signal 
on the NMI pin. of the local proc- 
essor. Vector information is ig- 
nored. A Delivery Mode equal to 
“NMI” requires a “level” trig- 
gered mode. 


111: (ExtINTA) means deliver the sig- 
nal to the INT pin of the local 
processor as an interrupt that 
originated in an externally con-— 
nected (8259A compatible) in- 
terrupt controller. ExtINTA pin is 
as- serted also. The INTA cycle 
that corresponds to this ExtiN- 
TA delivery, should be routed to 
the external controller that is ex- 
pected to supply the vector. A 
delivery mode of ExtiNT re- 
quires an edge trigger mode. 
(See the section on compatiblity 
for more details.) 


Bit 11 is Reserved. It should be writ- 
ten 0. 


Bit 11: 


Delivery Status: [Bit 12] 


This field is software-read only. Soft- 
ware writes to this field (as part of a 
32-bit word) have no effect on this 
bit. Delivery status is a 1-bit field that 
contains the current status of the de- 
livery of this interrupt. Two states are 
defined. 


0: (idle) means that there is currently 
no activity for this interrupt. 


1: (send pending) indicates that the 
interrupt has been injected, but its 
delivery is temporarily held up by 
the recently injected interrupts that 
are in the process of being deliv- 
ered. 


Bits [91:17], |ti2 | atti |) sitstio8) | Bits (7-0) 


Figure 5. Local Vector Table 
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Bit 13: Bit 13 is Reserved. It should be written 


0. 


Remote IRR: [Bit 14] 


This bit is used for level triggered local 
interrupts; its meaning is undefined for 
edge triggered interrupts. Remote IRR 
mirrors the interrupt’s IRR bit of this 
local unit. Remote IRR is software 
read-only; software writes to this bit do 
not affect it. 


Trigger Mode: [Bit 15] 


The Trigger Mode field indicates the 
type of signal on the local interrupt pin 
that triggers an interrupt. 


0: indicates edge sensitive, 
1: indicates level sensitive. 


Only the local interrupt pins can be 
programmed as edge or. level trig- 
gered. Timer interrupts are always 
treated as edges. 


MASK: [Bit 16] | 
0: enables injection of the interrupt, 
1: masks injection of the interrupt. 


Bits [31:17] Bits [31:17] are Reserved. Should be 
written 0. 


6.4 Inter-Processor Interrupt 
Registers 


A processor generates inter-processor interrupts by 
writing to the Interrupt Command Register in its 
82489DX Local Unit. Conceptually, this can be 
thought of as the processor providing the interrupt’s 
Redirection Table Entry on the fly. Not surprisingly, 
the layout of the Interrupt Command Register re- 
sembles that of an entry in the Redirection Table. 
Note that the format of this register allows a proces- 
sor to generate any interrupt. A processor may use 
this to forward device interrupts originally accepted 
by it to other processors. 


All fields of the Interrupt Command Register are 
read-write by software with the exception of the De- 
livery Status field which is read-only. Writing to the 
32-bit word that contains the interrupt vector causes 
the interrupt message to be sent. 


Interrupt Command Register [31:0] 
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The vector identifies the interrupt being 
sent. If the Delivery Mode is “Remote 
Read’, then the Vector field contains 
the address of the register to be read in 
the remote 82489DX’s Local Unit. The 
addresses are listed in the section dis- 
cussing 82489DX Local Unit register 
summary. For example, for ID register, 
remote read address of 02 should be 
specified in vector field. 


Delivery Mode: [Bits 10-8] 


The Delivery Mode is a 3-bit field that 
specifies how the 82489Dx listed in the 
destination field (bits 63:32) should act 
upon reception of this signal. Note that 
certain Delivery Modes will only operate 
as intended when used in conjunction 
with a specific Trigger Mode. These re- 
strictions are indicated for each Delivery 
Mode. 


000: (Fixed) means deliver the signal on 
the INT pin of all processors listed 
in the destination. Trigger Mode 
for “fixed’’ Delivery Mode can be 
edge or level. 


001: (Lowest Priority) means deliver the 
signal on the INT pin of the proc- 
essor that is executing at the lower 
priority among all the processors 
listed in the specified destination; 
Trigger Mode for “lowest priority” 
Delivery Mode can be edge or lev- 
el. 


010: Intel Reserved. Should not be 
used. 


011: (Remote Read) is a request to a 
remote 82489DX Local Unit to 
send the value of one of its regis- 
ters over the ICC bus. The register 
is selected by providing its address 
in the Vector field. The register val- 
ue is latched by the requesting 
82489DX and stored in the Re- 
mote Register where it can be 
read by the local processor. A De- 
livery Mode of “Remote Read” re- 
quires an “edge” Trigger Mode. 
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100: (NMI) means deliver the signal on 
the NMI pin of all processors list- 
ed in the destination, vector infor- 
mation is ignored. A Delivery 
Mode equal to “NMI” requires a 
“level” Trigger Mode. 


101: (Reset) means deliver the signal 
to all local units listed in the desti- 
nation. The destination local unit 
will assert/deassert its PRST out- 
put pin. All addressed 82489DX 
Local Units will assume their re- 

' set state but preserve their ID. 
One side effect of an ICC bus 
message with Delivery Mode 
equal to “Reset” that results in a 
deassert of reset is that all Local 
Units (whether listed in the desti- 
nation or not) will reset their low- 
est-priority tie breaker arbitration 
ID to their Local Unit ID (see the 
section on the ICC bus for de- 
tails). A Delivery Mode of ‘“Re- 
set” requires a “level” Trigger 
Mode. “RESET” should not be 
used with ‘‘self” or ‘‘all incl self’ 
Shorthand mode since it will 
leave the system in non-recover- 
able reset state. If “RESET” is 
used with ‘‘all excl self’ mode 
software should make sure that 
only one CPU executes this in- 
struction in a MP system. 


110: Intel Reserved. Should not be 
used. 


111: Intel Reserved. Should not be 
used 


Destination Mode: [Bit 11] 


This field determines the interpretation 
of the Destination field. 


0: (Physical Mode): in Physical Mode, 
a destination 82489DxX is identified 
by its Local Unit ID. Bits 56 through 
63 (8 MSB of the destination field) 
specify the 8-bit 82489DX Local 
Unit ID. | 


1: (Logical Mode): in Logical Mode, 
destinations are identified by match- 
ing on Logical Destination under the 
control of the Destination Format 
Register in each Local 82489DX. 
The 32-bit Destination field is the 
logical destination. | 
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Delivery Status: [Bit 12] 


Bit 13: 


Delivery Status is a 1-bit field that con- 
tains the current status of the delivery 
of this interrupt. Two states are de- 
fined: 


0: (Idle) means that there is currently 
no activity for this interrupt; 


1: (Send Pending) indicates that the 
interrupt has been injected, but its 
delivery is temporarily held up by 
other recently injected interrupts 
that are in the process of being de- 
livered; 


Delivery Status is software read-only; 
software writes to this field (as part of 
a 32-bit word) do not affect this bit. 
Software can read to find out if the 
current interrupt has been sent, and 
the Interrupt Command Register is 
available to send the next interrupt. If 
the Interrupt Command Register is ov- 
erwritten before the Delivery Status is 
“Idle”, then the destiny of that inter- 
rupt is undefined; i.e., the interrupt may 
have been lost. 


Bit 13 is Reserved . Should be written 
0. 


Level: [Bit 14] 


Software can use this bit in conjunc- 
tion with the Trigger Mode bit when is- 
suing an inter-processor interrupt to 
simulate assertion/deassertion of lev- 
el sensitive interrupts. 


To assert: Trigger mode = 1 and Lev- 
el = 1. 


To deassert: Trigger mode = 1 and 
Level = 0. 


For example, a message with Delivery 
Mode of “Reset’’, a Trigger Mode of 
“Level”, and Level bit of 0 deasserts 
Reset to the processor of the ad- 
dressed 82489 DX Local Unit(s). As a 
side effect, this will also cause all 
82489DxX to reset their Arbitration ID 
to their unit ID. (The Arb ID is used for 
tie breaking in lowest priority arbitra- 
tion.) 


Trigger Mode: [Bit 15] 


Software can use this in conjunction 
with Level Assert/Deassert to gener- 
ate interrupts that behave as edges or 
levels. 


0: Edge 
1: Level 
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Remote Read Status: [Bits 17,16] 


This field indicates the status of the 
data contained in the Remote Read 
register. This field is read-only to soft- 
ware. Whenever software writes to the 
Interrupt Command Register using De- 
livery Mode “Remote Read” the Re- 
mote Read status becomes “‘in-prog- 
ress” (waiting for the remote data to ar- 
rive). The remote 82489DxX Local Unit is 
expected to respond in a fixed amount 
of ICC bus cycles. If the remote 
82489DX Local Unit is unable to do so, 
then the Remote Read status becomes 
‘Invalid’. If successful, the Remote 
Read status resolves to ‘Valid’. Soft- 
ware should poll this field to determine 
completion and success of the Remote 
Read command. 


00: (invalid): the content of the Remote 
Read Register is invalid. This is the 
case after a Remote Read com- 
mand issued and the remote 
82489DX Local Unit was unable to 
deliver the Register content in time. 


01: (in progress): a Remote Read com- 
mand has been issued and this 
82489DxX is waiting for the data to 
arrive from the remote 82489DX Lo- 
cal Unit. 


10: (valid): the most recent Remote 
Read command has completed and 
the remote register content in the 
Remote Read Register is valid. 


11: reserved. 


Destination Shorthand: [Bits 19,18] 
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This field indicates whether a shorthand 
notation is used to specify the destina- 
tion of the interrupt and if so, which 
shorthand is used. Destination Short- 
hands do no use the 32-bit Destination 
field, and can be sent by software with 
a single 32-bit write to the 82489DX’s 
Interrupt Command Register. Short- 
hands are defined for the following 
common cases: software self interrupt, 
interrupt to all processors in the system 
including the sender, interrupts to all 
processors in the system excluding the 
sender. 


00: (dest field): means that no short- 
hand is used. The destination is 
specified in the 32-bit Destination 
field in the second word (bits 32 to 
63) of the Interrupt Command Reg- 
ister. 
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01: (self): means that the current 
82489DX Local Unit is the single 
destination of the interrupt. This is 
useful for software interrupts. The 
Destination field in the Interrupt 
Command Register is ignored. RE- 
SET Delivery mode should not be 
used with self destination. Only 
FIXED delivery mode should be 
used with SELF. 


10: (all incl self): means that the inter- 
rupt is to be sent to “all’’ proces- 
sors in the system including the 
processor sending the _ interrupt. 
The 82489DxX will broadcast a mes- 
sage with destination unit ID field 
set to all ones. RESET assert Deliv- 
ery mode should not be used with 
“all incl self” destination. 


11: (ail excl self): means that the inter- 
rupt is to be sent to “all” proces- 
sors in the system with the excep- 
tion of the processor sending the 
interrupt. The 82489DX will broad- 
cast a message with destination 
unit ID field set to all ones. All-excl- 
self is useful during selection of a 
boot processor (init) and also for 
TLB flush where “self” is flushed 
using the processor flush instruc- 
tion. Only one CPU in a MP system 
should execute “‘all excl self” desti- 
nation if used with RESET Delivery 
mode. 


Bits [31:20] Bits [31:20] are Reserved. They should 
be written 0. 


Interrupt Command Register [63:32] 


Destination: [Bits 63-32] 


This field is only used when the Desti- 
nation Shorthand is set to “Dest Field’. 
lf Destination Mode is Physical Mode, 
then the 8 MSB contain a Destination 
unit ID. If Logical Mode, the full 32-bit 
Destination field contains the logical ad- 
dress. The enabling is done by Destina- 
tion Format Register. 


6.5 IRR, ISR, TMR Registers 


INTERRUPT ACCEPTANCE 


All 82489DX Local Units listen to all messages sent 
over the ICC bus. For each message, the local unit 
first checks if it belongs to the destination in the 
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message. It does this by matching the 32-bit Desti- 
nation field in the message against its logical Desti- 
nation Register, if the message addresses in logical 
mode, and against its physical ID, if the message 


addresses in physical mode. All 82489DX Local 
Units that match are said to ‘‘belong to the group”. 


Each 82489DX Local Unit contains three 256-bit 
registers that play a role in the acceptance of inter- 
rupts and in dispensing accepted interrupts to the 
local processor. Each of these registers is a bit array 
where bit position i tracks information about the in- 
terrupt with vector i. These bits track information 
about the (PINT) maskable interrupts only. They are 
not relevant for NMI, RESET or ExtINT type of inter- 
rupts. The Interrupt Request Register (IRR) contains 
the interrupts accepted by this 82489DX Local Unit 
but not yet dispensed to the processor. The In Serv- 
ice Register (ISR) contains the interrupts that are 
currently in service by the processor, i.e., the inter- 
rupts that have been dispensed to the processor but 
for which the processor has not yet signaled the 
End-Of-Interrupt. 


Note that the 82489DX’s IRR and ISR registers have 
the same meaning and operation as in the 8259A in 
fully nested/non-specific EOI mode. Note also that 
these registers play no role in providing 8259A com- 
patibility. Compatibility is handled by making an ex- 
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ternal 8259A-style controller directly visible to the 
processor and having the 82489DX become trans- 
parent. 


Each interrupt has a vector associated with it, which 
determines the bit position, and hence the priority for 
the interrupt. When an interrupt is being serviced, all 
equal or lower priority interrupts are automatically 
masked by the 82489DX Local Unit. 


The Trigger Mode Register (TMR) indicates for each 
interrupt whether the interrupt is edge or level. This 
information is transmitted with each 82489DxX inter- 
rupt request message and reflects the Trigger Mode 
bit in the interrupt’s Redirection Table entry. If an 
interrupt goes in service and the TMR bit is 0 (edge), 
then the interrupt’s IRR bit is cleared at the same 
time the ISR bit is set. If the TMR bit is 1 (level), then 
the IRR bit is not cleared when the interrupt goes in 
service. In the latter case, the IRR bit mirrors the 
state of the interrupt’s input pin. 


The following diagram shows 82489DX operation 
with devices A and B sharing a level triggered inter- 
rupt input. The diagram illustrates how Remote IRR, 
and the IRR bit at the destination 82489DX track the 
state of INTIN. It also illustrates how an EOI is fol- 
lowed immediately be re-raising the interrupt as long 
as the INTIN is still asserted by some device. 


4 


/ \ XOR of INTIN and Remote IRR / \ 
\. pe "level deassert” 


message 


Remote IRR 


IRR at dest Unit 


INT/EOI 


message 
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Figure 6. Interrupt Sharing 
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ISR, IRR, and TMR are read-only by software. Each 
of these 256-bit registers is accessed as four sepa- 
rate 32-bit registers. Note that there is no general 
Interrupt Mask Register (IMR) as in the 8259A. The 
processor masks interrupts temporarily by writing to 
the Icoal unit’s Task Priority Register (described 
shortly). 


ISR [Interrupt Status Register] 


IRR [Interrupt Request Register] 


Bits [255:0] 


TMR [Trigger Mode Register] 


Figure 7. ISR, IRR, and TMR 


TMR (Trigger Mode Register): 


lf O [edge triggered] the corresponding 
IRR_ bit is automatically cleared when 
interrupt service starts. If 1[level trig- 
gered] this is not the case; instead, the 
source 82489DX must explicitly request 
the IRR bit be cleared (upon deassert 
of the interrupt input pin or upon send- 
ing an appropriate interprocessor inter- 
rupt). Upon acceptance of an interrupt, 
the TMR bit is cleared for edge trig- 
gered interrupts and set for level trig- 
gered interrupts. This information is car- 
ried in the accepted interrupt message. 
The source 82489DX 1|/O unit also 
tracks the state of the destination unit’s 
IRR bit (Remote IRR bit in the Redirec- 
tion Table). When a level triggered in- 
terrupt input is deasserted, the source 
82489DX |/O unit detects the discrep- 
ancy between the input pin state and 
the Remote IRR, and automatically 
sends a message telling the destination 
82489DxX to clear IRR for the interrupt. 
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IRR (Interrupt Request Register): 


It contains the active interrupt requests 
that have been accepted, but not yet 
dispensed by this 82489DX Local Unit. 
A bit in IRR is set when the 82489DX 
Local Unit accepts the interrupt. When 
TMR is 0, it is cleared when the inter- 
rupt is serviced; when TMR is 1, it is 
cleared when the 82489DX Local Unit 
receives a message to clear it. 


ISR (In Service Register): 


It marks the interrupts that have been 
delivered to the processor, but that 
have not been fully serviced in that an 
End-Of-interrupt has not yet been re- 
ceived. The ISR register reflects the 
current state of the processor’s inter- 
rupt stack. 


ACCEPTANCE MECHANISM 


Interrupt acceptance proceeds as follows. If the de- 
livery mode is Fixed, then each unit in the destina- 
tion group unconditionally accepts the interrupt mes- 
sage and sets the interrupt’s IRR bit. If the delivery 
mode is Lowest Priority, then each processor in the 
group first checks if it is currently the focus of the 
interrupt by checking its ISR and IRR. If an 82489DX 
finds one of these bits set for the incoming interrupt, 
then that 82489DX Local Unit accepts the interrupt 
independent of priority, and “signals” the other 
82489DX Local Units to abort the priority arbitration. 
This avoids multiple delivery of a same interrupt oc- 
currence to different processors, consistent with in- 
terrupt delivery semantics in uniprocessor systems 
as described above. If a message is to be delivered 
for NMI or Reset, then all 82489DX Local Units list- 
ed in the destination unconditionally assert/deassert 
the corresponding output pin. ISR, IRR, etc. are by- 
passed for NMI or reset and vector information is 
undefined. 
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The acceptance decision process is illustrated in the flow chart below. 


Wait to receive 
ICC message 
, Belong to 
Discard Message Bestination 


Delivery Mode Set Interrupt's IRR 


Lowest Pty 


Am | 
the Winner 
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Figure 8. interrupt Acceptance Flow Chart 
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6.6 Tracking Processor Priority 


Each 82489DX Local Unit should be programmed 
with task priority so that it can mask interrupts 
that are less priority than that of the processor tem- 
porarily. 


Task switching and task priority changes are the re- 
sult of explicit software action. The operating system 
may define a number of task scheduling classes. Ex- 
amples are an idle class, a background class, a fore- 
ground class, and a time critical class. Alternatively, 
different classes can be assigned to user code 
versus system code. If tasks in different classes are 
executing when an interrupt comes in, then it may be 
advantageous to interrupt the processor currently 
running the task in the least important class. Clearly, 
if one processor is idle while others are doing work, 
the idle processor would be the obvious target for 
servicing the interrupt. This implies that there is use 
in defining priority levels below all interrupt levels 
that can participate in lowest priority delivery selec- 
tion. 


At times, the operating system may need to block 
out interrupts from being serviced. For example, to 
synchronize access to a shared data structure be- 
tween a device driver and its interrupt handler the 
driver raises it priority to equal or higher than the 
interrupt’s priority. 


The local 82489DX supports this via its Task Priority 
Register (the 8259A supports this via the interrupt 
mask register (IMR).) Software that wants to make 
use of this is required to inform its 82489DX Local 
Unit of the prioity change by updating the Task Prior- 
ity Register. The Task Priority field is 8 bits providing 
up to 256 distinct priorities. The 4 MSB of this regis- 
ter correspond to the 16 interrupt priorities while the 
4 LSB provide more precision. Priorities are best 
noted as x:y, where x is the value of the 4 MSB and y 
is the value of the 4 LSB. For example, Task Priority 
Register values 0:y with O < y < 15 (and 0 in the 4 
MSB) can be used to represent the priorities of the 
task scheduling classes described above (y = 0 for 
idle; y = 1 for background; etc.). Except for inter- 
rupts with vectors 0 through 15 (which are often pre- 
defined by the processor) which all have priority 0:0, 
the priorities of all other interrupts and their handlers 
is x:0 with 1 < x < 15 and is above the base task 
priorities O:y. 


For example, interrupt vector 123 has priority 7:0 


(123/16 = 7) and can be masked by any task that 
raises its priority to a value equal or higher than 7:0. 
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82489DX uses Task priority register for the purpose 
of masking the interrupts. The task priority register 
should be programmed with a priority value to speci- 
fiy the priority of task the processor is executing. 


82489DX masks any interrupts of lower or equal pri- 
ority when compared with task priority. 


When task priority register is programmed with the 
priority 15, all the interrupts are masked. When task 
priority register is programmed with priority level X, 
by definition, all the interrupts of priority X and below 
X will be masked. When task priority register is pro- 
grammed with the priority 0 then all the interrupts 
above priority 0 are allowed to interrupt the proces- 
sor. This means that when task priority register is 
programmed even with the lowest value, i.e., 0, inter- 
rupts of priority 0 will be masked. So only 240 inter- 
rupt vectors should be used in 82489DxX. Interrupt 
vecotrs from 0 to 15 should not be used. 


The first priority value computed is the maximum of: 
e Task Priority (4msb : 4lsb) and 


e the priority of the highest order ISR bit set ((vec- 
tor/16) :0). 


The value is used to determine whether or not a 
pending interrupt can be dispensed to the proces- 
sor. 


The second priority value computed is the maximum 
of: 
e Task Priority (4msb : 4lsb), and 


e the priority of the highest order ISR bit set ((vec- 
tor/16) :0), and 


e the priority of the highest order IRR bit set ((vec- 
tor/16) :0). 


This value is used during arbitration as part of low- 
est-priority delivery. 


Task Priority Register 


From the information in the Task Priority Register 
and the priority information derived from the ISR and 
IRR register, the 82489DX Local Unit computes two 
additional priority values: 


Bits [31:8] Bits [31:8] are Reserved. They should be 
written 0. 


Bits [7:0] Task Priority 


Bits [7:0] are used to specify the task pri- 
ority. 
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6.7 Dispensing Interrupts 


DISPENSING INTERRUPTS TO THE LOCAL 
PROCESSOR 


Once a 82489DX Local Unit accepts an interrupt, it 
guarantees delivery of the interrupt to its local proc- 
essor. (This part of the 82489DX functions similarly 
to an 8259A.) Dispensing a maskable interrupt to the 
local processor begins when the Local Unit asserts 
the INT pin of its processor. If the processor has 
interrupts enabled, it will respond by issuing an INTA 
cycle. This causes the Local Unit to freeze its inter- 
nal priority state and release the 8-bit vector of the 
highest priority interrupt on the data bus where it is 
read by the processor and used to find the handler’s 
entry point. The INT/INTA protocol also causes the 
interrupt’s ISR bit to be set. The corresponding bit in 
the IRR register is only cleared if the TMR register 
indicates it should do so (edge triggered interrupts), 
otherwise (level triggered interrupts), IRR is only 
cleared when the Interrupt Input Pin is deasserted. 


6.8 Spurious Interrupt Vector Register 


SPURIOUS INTERRUPT 


Note that it can happen that a level-triggered inter- 
rupt is deasserted right before its INTA cycle. In that 
case, all IRR bits may be clear and the prioritizer 
may not find a vector to give to the processor. To 
satisfy the processor’s demand for a vector, instead, 
the 82489DxX will return a spurious interrupt vector 
instead. 


A similar situation may occur when the processor 
raises its Task Priority at or above the level of the 
interrupt for which the Processor INT pin is currently 
being asserted. When the INTA cycle is issued, the 
interrupt that was to be dispensed has become 
masked (masked but remembered). 


Dispensing the spurious interrupt vector does not af- 
fect the ISR register, so the handler for this vector 
should just return without EOI. If the vector is shared 
with a valid interrupt, then the handler can read the 
vector’s bit in the ISR register to check if it is in- 
voked for the valid interrupt (ISR bit set) or not (ISR 
bit clear). Given the range of 240 vectors, overload- 
ing the spurious interrupt with a valid interrupt is not 
expected to be common practice. The spurious in- 
terrupt vector to be used by a Local Unit is program- 
mable via the Spurious Interrupt Vector Register. 


‘UNIT ENABLE 


It is possible that Local Units exist in the system that 
do not have a processor to which to dispense inter- 
rupts. The only danger this represents in the system 
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is that if any interrupt is broadcast to all processors 
using lowest priority delivery mode when all proces- 
sors are at the lowest priority, there is a chance that 
a Local Unit without the processor may accept the 
interrupt if this Local Unit happens to have the low- 
est Arb ID at the time. To prevent this from happen- 
ing, all Local Units initialize in the disabled state and 
must be explicitly enabled before they can either 
start accepting or transmitting messages from the 
ICC bus. A disabled 82489DX Local Unit only re- 
sponds to messages with Delivery Modes set to 
‘Reset’. Reset deassert messages should be sent 
in Physical Destination mode using the target’s Lo- 
cal Unit ID since the logical destination information 
in the Icoal units is undefined (all zeroes) when the 
82489DX comes out of Reset. 
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Figure 9. Spurious Interrupt Vector Register 
Bits [31 .. 9] Reserved Bits. Should be written 0. 


Unit Enable: [Bit 8] 


0: When a 0 is written to this bit, this 
Local Unit gets disabled with regard 
to responding to messages sent as 
well as transmitting on the ICC bus. It 
only responds to messages with De- 
livery Mode set to “Reset’’. Reading 
a 0 at this bit indicates that the unit is 
disabled. 


1: When a 1 is written to this bit, the 
current Local Unit is enabled for both 
transmitting and receiving unit mes- 
sages. Reading a 1 at this bit indi- 
cates that the unit is enabled. 


Spurious Vector: [Bits 7-0] 


For future compatibility, bits [3-0] 
should be 1111. 


6.9 End-of-interrupt (EOI) Register 


Before returning from the interrupt handler, software 
must issue an End-Of-Interrupt (EOI) command to 
the 82489DX Local Unit. The data written to EO! 
register is don’t care. This tells the 82489DxX to clear 
the highest priority bit in the ISR register since the 
interrupt is no longer in service. Upon EOI, 82489DX 
goes through prioritization returning to the next high- 
est priority activity. This can be a previously inter- 
rupted handler (from ISR), a pending interrupt re- 
quest (from IRR), or an interrupted task (from Task 
Priority). 


Bits [31:0] 


Figure 10. EOI Register 
Bits [31:0]: are don’t care. 
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6.10 Remote Read Register 


Since all 82489DX Local Units would typically occu- 
py the same address range, an 82489DxX local unit’s 
registers can only be accessed by the local proces- 
sor. From a system debugging point of view, this 
would mean that a large amount of state would be- 
come inaccessible if its corresponding processor 
hangs for whatever reason. To assist in the debug- 
ging of MP systems, the 82489DX support a mecha- 
nism that provides read-only access to any register 
in any other 82489DxX Icoal unit in the system. 


To read any register in a “remote” 82489DX Local 
Unit, the processor writes to the Interrupt Command 
Register specifying a Delivery Mode equal to ‘‘Re- 
mote Read’. The remote 82489DxX is specified in 
the Destination field of the Interrupt Command Reg- 
ister in the usual fashion. Debug software would 
make sure that this selects a single 82489DX only— 
for example by using the target’s 82489DX Local 
Unit ID in physical destination mode. Since no vector 
is associated with remote register access, the Vec- 
tor field in the Interrupt Command Register is used 
to select the individual remote 32-bit register to be 
read. The selector value corresponds to the address 
(offset) of the register in the local 82489DX’s ad- 
dress space. Sending a “Remote Read’ command 
results in sending a message on the ICC bus. The 
destination 82489DX responds by placing the 32-bit 
content of the selected register on the ICC bus. This 
value is read by sending the 82489DxX and place it in 
the Remote Register where software can get at it 
using regualr register access to its 82489DX Local 
Unit. The Remote Register is software read-only. 
The contents of the Remote Register is valid when 
the Delivery Status in the Interrupt Command Regis- 
ter has become “Idle” again. 


Remote Read Register 


Bits [31:0] 


Figure 11, Remote Register 


Bits [31:0] Bits [31:0] contain the contents of Re- 
mote Read Register. 


6.11 82489DX Local Configuration 


LOCAL VERSION REGISTER 


Each 82489DX Local Unit contains a hardware Ver- 
sion Register that identifies this 82489DX Local Unit 
version. This register is read only. 
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Local Version Register 


Figure 12. Local Version Register 


Version: [Bits 7-0] 


This is a version number that identifies 
this version. This field is hardwired and 
is read-only. Will be read as ‘1” for 
82489DX. 


Bits [31:8] Bits 31:8 are reserved. 


6.12 82489DX Timer Registers 


Overview 


82489DX Local Unit contains one 32-bit wide pro- 
grammable binary timer for use by the local proces- 
sor. The timer can select its clock base from one of 
three possible clock inputs. A timer mode can be 
programmed to operate in either one-shot mode or 
periodic mode. The timer can be configured to inter- 
rupt the local processor with a vector. 


Time Base 


The 82489DxX has two independent clock input pins: 


1. The CLK pin provides the clock signal that drives 
the 82489DX’s internal operation. 


2. The TMBASE pin allows an independent clock 
signal to be connected to the 82489Dx for use by 
the timer functions. 


Signals from both CLK and TMBASE can be used as 
clock inputs that feed the timer. In addition, the 
82489DxX contains a divider that can be configured 
to divide either input clock signal. The divider can be 
programmed to divide the selected input clock by 2, 
4, 8, or 16. CLK, TMBASE, and the output of the 
divider together provide three time bases: Base 0, 
Base 1, and Base 2. Base 0 is always equal to CLK: 
Base 1 is always equal to TMBASE; and base 2 is 
one of; CLK/2, CLK/4, CLK/8, CLK/16, TMBASE/2, 
TMBASE/4, TMBASE/8, or TMBASE/16. The timer 
can independently select one of these three time 
bases as its clock input as depicted in the following 
diagram. 


Base 0 


DIVIDE BY 
2, 4, 8, 16 


Base 2 


TMBASE Base 1 
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Figure 13. Time Bases 
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Figure 14. Divider Configuration Register 


Bits [31:3] Bits 31 to 3 are reserved. They should 
be written 0. 


Divider Input: 


[Bit 2] Selects whether divider’s input 
connects to the 82489DX Local Unit’s 
CLK pin or TMBASE pin. 


0: means the divider takes its input sig- 
nal from CLK, 


1: means use TMBASE. 


Divide By: [Bits 1,0] 


This field selects by how much the divid- 
er divides. 


00: divide by 2 
01: divide by 4 
10: divide by 8 
11: divide by 16 


Timer 


Software starts a timer going by programming its Ini- 
tial Count Register. The timer copies this value into 
the Current Count Register and starts counting down 
at the rate of one count for each time base pulse. 
The time is one of Base 0, Base 1, or Base 2. 


The timer has a programmable mode which can be 
One-Shot or Periodic. After the timer reaches zero in 
One-Shot mode, the timer simply stays at zero until it 
is reprogrammed. In Periodic mode, the timer auto- 
matically reloads its Current Count from the Initial 
Count and starts counting down again. 


For the timer, interrupt generation can be disabled or 
enabled, and an arbitrary interrupt vector can be 
specified. When enabled and the timer reaches 
zero, an interrupt is generated at the 82489DX Local 
Unit. Timer generated interrupts are always treated 
as edges. They can only generate maskable inter- 
rupts to the local processor. 


TIMER VECTOR TABLE 
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A timer set up with its interrupt masked is useful as a 
time base that can be sampled by the local proces- 
sor by reading the Current Count Register, for the 
purpose of measuring the intervals. By mapping the 
82489DX’s register space into a read-only user 
page, safe and efficient performance monitoring of 
user programs can be supported. 


If necessary, software may want to ensure that peri- 
odic timer interrupts on the different 82489DX Local 
Units are staggered such that the 82489DXs don’t 
all deliver their interrupt (e.g., a timer slice interrupt) 
to their local processor at the same time. This stag- 
gering avoids bursts of contention for shared re- 
sources (bus, cache lines, dispatch queue, locks). 
Randomness occurring ‘‘naturally” may be sufficient 
to ensure staggering. 


Initial Count Register 


Bits [31:0] Initial Count 


Current Count Register 


Bits [31:0] Current Count 


Figure 15. Initial Count and 
Current Count Registers 


Software writes to this register to set 
the initial count for timer. This regis- 
ter can be written at any time. When 
written, its value is copied to the Cur- 
rent Count Register and countdown 
starts or continues from there. The 
Initial Count Register is read-write by 
software. 


Current Count: This is the current count of timer. It is 
read-only by software and can be 
read at any time. 


Initial Count: 


The timer is configured via its Local Vector Table 
entry shown below (see also Interrupt Control in this 
section). 


Vector: [Bits 7-0] 


This is the 8-bit interrupt vector to be 
used when timer generates an inter- 
rupt. 


Bits [31:20] | Bits [19:18] Bits [15:13] Ea Bits [11:8] | Bits [7:0] 


Figure 16. Local Vector Table: Timer Entry 
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Bits 11-8 Reserved. Should be written 0. 


Delivery Status: [Bit 12] 


Delivery Status is a 1-bit field that con- 
tains the current status of the delivery 
of this interrupt. Two states are de- 
fined: 


0: (Idle) means that there is currently 
no activity for this interrupt; 


1: (Send Pending) indicates that the in- 
terrupt has been injected, but its de- 
livery is temporarily held up by other 
recently injected interrupts that are 
in the process of being delivered; 
Delivery Status is software read- 
only; software writes to this field (as 


part of a 32-bit word) do not affect 


this bit. 
Bits 15-13: Reserved. Should be written 0. 


MASK: [Bit 16] 


This bit serves to mask timer interrupt 
generation. 


0: means not masked, when timer 
reaches 0, it generates an interrupt 
with vector at the 82489DX Local 
Unit 

1: means masked, and no interrupt is 
generated. 


Timer Mode: [Bits 17] 


This field indicates the operation mode 

of timer. 

0: (One-Shot): the Current Count Reg- 
ister remains at zero after the timer 
reaches zero, and software needs to 
reassign the timer’s Initial Count 
Register to rearm the timer. 

1: (Periodic): when the timer reaches 
zero, the Current Count Register is 
automatically reloaded with the val- 
ue in the Initial Count Register, and 
the timer counts down again. 

Timer Base: [Bits 19,18] 


This field selects the time base input to 
be used by timer. 


00: (Base 0) uses “CLKIN”’ as input; 
01: (Base 1) uses ‘“TMBASE”; 


10: (Base 2) uses the output of the di- 
vider (Base 2). 


Bits [31:20] Bits [81:20] are Reserved. Should be 
written 0. 
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7.0 82489DX I/O UNIT REGISTERS 


REGISTERS ADDRESSING SCHEME 


The I/O Unit indirect addressing scheme uses two 
registers directly mapped into the processor’s ad- 
dress space: the I/O Register Select register and 
the I/O Window register. The I/O register select reg- 
ister selects which |/O unit Register appears in the 
1/O Window register where it can be manipulated by 
software. 


|/O Register Select Register 


Figure 17. 1/O Register Select Register 


Bits [31:8]: Reserved. Should be written 0. 


Bits [7:0]: 1/O REGISTER SELECT: This register 
selects an 82489DX I/O unit register. 
The contents of the selected 32-bit reg- 
ister can be manipulated via the I/O 
Window Register. The I/O Register Se- 
lect register is read-write by software. 


I/O Window Register 


Figure 18. 1/O Window Register 


Bits [31:0] |/O WINDOW REGISTER: This register 
is mapped onto the I/O Unit’s register 
selected by the I/O Register Select reg- 
ister. Readability/writability by software 
is determined by the I/O unit register 
that is currently selected. 


The addresses (offsets to a platform-de- 
fined base address) of all registers are 
listed in the register summary section. 
Note that register offsets are aligned on 
128-bit boundaries; in other words, regis- 
ters are located only at every fourth 
32-bit address. This eliminates the need 
for lane-steering glue logic when con- 
necting the 82489DX’s 32-bit data bus to 
a wider (64-bit and 128-bit) bus. 
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82489DX |/O UNIT CONFIGURATION 


1/O Unit ID Register 


Each 82489DxX |/O Unit has a register that contains 
the I/O Unit’s 8-bit ID. The I/O unit ID serves as a 
physical name of the 82489DxX I/O Unit. It is used in 
arbitrating for |CC bus ownership when the I/O unit 
wants to access the ICC bus for sending any inter- 
rupt message. Unlike the local unit ID, the 1/O unit ID 
is not latched-in from the address bus during hard- 
ware reset. The I/O unit ID is set to 0 during reset. 
The software has to write different ID into the I/O 
Units before starting interrupt messages on the ICC 
bus. 


I/O Unit ID 


| Bits 31:24) | Bits (29:0) 

Bits [31:24] I/O Unit ID: 
The I/O unit ID serves as the physical 
“name” of the 82489DX unit used for 
arbitration purposes for the ICC bus us- 
age. In a system with, say, four 
82489DxX, there are 4 Local Units and 4 
I/O Units. All the 8 units should be as- 
signed different ID. The IDs should start 
with 0 and each unit should have differ- 
ent ID. 

Bits [23:0] Bits 23..0 are reserved. Should be writ- 
ten 0. 


I/O Unit Version Register 


Each 82489DX I/O Unit contains a hardware Ver- 
sion Register that identifies this 82489DX |/O unit 
version. This register is read only. 


1/O Unit Version Register 


Version: [Bits 7-0] 


This is a version number that identifies 
this version. This field is hardwired and 
is read-only. Will be read as “1” for 
82489DxX. 


Bits [15:8] Bits [15:8] are reserved. 


Redirection Table Entry 


82489DX 


Max Redir Entry: [Bits 23-16] 


This is the entry number (0 being the 
lowest entry) of the highest entry in the 
Redirection Table. It is equal to the 
number of Interrupt Input Pins minus 
one of this 1/O Unit. This field is hard- 
wired and is read-only. 


In the 82489DX I/O unit this is read as 
15. 


Bits [31:24] Bits [31:24] are reserved. 


I/O UNIT INTERRUPT SOURCE REGISTERS 


Redirection Tables 


The Redirection Table has a dedicated entry for 
each interrupt input pin. Unlike IRQ pins of the 
8259A, the notion of interrupt priority is completely 
unrelated to the position of the physical interrupt in- 
put pin on the 82489Dx. Instead, software can de- 
cide for each pin individually what it wants the vector 
(and therefore the priority) of the corresponding in- 
terrupt to be. For each individual pin, the operating 
system can also specify whether the interrupt is sig- 
naled as edges or levels, as well as the destination 
and delivery mode of the interrupt. The information 
in the Redirection Table is used to translate the in- 
terrupt manifestation on the corresponding interrupt 
pin into an inter-82489DX message. 


In order for a signal on an edge-sensitive Interrupt 
Input pin to be recognized as a valid edge ( and not 
a glitch) the input level on the pin must remain as- 
serted until the time 82489DX I/O Unit sends the 
corresponding message over the ICC bus. Only then 
will the source 82489DxX be able to recognize a new 
edge on that Interrupt Input pin. That new edge will 
only result in a new invocation of the handler if its 
acceptance by the destination 82489DX causes the 
Interrupt Request Register bit to go from 0 to 1. (In 
other words, if the interrupt wasn’t already pending 
at the destination.) 


82489DxX I/O unit has 16 Redirection Table entries. 
The layout of an entry in the Redirection Table is as 
follows: 


_aistr17) | errs | sinis | etre | atts | sine | str | aitsttoe) | aitsi7-0)_| 
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Vector (Bits [7:0] 
Interrupt vector for this interrupt 


Delivery Mode (Bits [10:8]) 
000: Fixed 
001: Lowest Priority 
010: <reserved> 
011: <reserved> 
100: NMI 
101: Reset 
110: <reserved> 
111: ExtINT 


Destination Mode (Bit 11) 
0: Physical 
1: Logical 
Delivery Status (Bit 12) 
0: Idle 
1: Send Pending 
Bit 13 Bit 13: Reserved. Should be written 0. 
Remote IRR _ (Bit 14) 
Reflects the Remote IRR bit 
0: Remote IRR bit is clear. 
1: Remote IRR bit is set. 
Trigger Mode (Bit 15) 
0: Edge 
1: Level 
Mask (Bit 16) 
0: Not Masked 
1: Masked , 
Bits [31:17] Reserved. Should be written 0. 


DESCRIPTIONS 


Vector: [Bits 7-0] 


The vector field is an 8-bit field con- 
taining the interrupt for this interrupt. 


Delivery Mode: [Bits 10-8] 


The Delivery Mode is a 3-bit field that 
specifies how the 82489Dxs listed in 
the destination field should act upon 
reception of this signal. Note that 
remote read is not supported for I/O 
device interrupts. Note that certain 
Delivery Modes will only operate as in- 
tended when used in conjuction with a 
specific Trigger Mode. These restric- 
tions are indicated for each Delivery 
Mode. 
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000: (Fixed) means deliver the signal 
on the INT pin of all processors 
listed in the destination. Trigger 


Mode for ‘‘fixed’’ Delivery Mode 
can be edge or level. 


001: (Lowest Priority) means deliver 
the signal on the INT pin of the 
processor that is executing at the 
lower priority among all the proc- 
essors listed in the specified des- 
tination; Trigger Mode for ‘‘lowest 
priority” Delivery Mode can be 
edge or level. 


100: (NMI) means deliver the signal on» 
the NMI pin of all processors list- 
ed in the destination, vector infor- 
mation is ignored. A_ Delivery 
Mode equal to “NMI” requires a 
“level” Trigger Mode. 


101: (Reset) means deliver the signal 
to all processors listed in the des- 
tination by asserting/deasserting 
the 82489DX’s Reset output pin. 
All addressed 82489DxXs’ Local 
Units will assume their reset state 
but preserve their unit ID. One 
side effect of a unit message with 
Delivery Mode equal to “Reset” 
that results in a deassert of reset 
is that all 82489DxXs’ Local Units 
(whether listed in the destination 
or not) will reset their lowest-prior- 
ity tie breaker arbitration ID to 
their unit ID (see the section on 
the ICC bus for details). A Deliv- 
ery Mode of “Reset” requires a 
“level” Trigger Mode. 


111: (ExtINT) means deliver the signal 
to the INT pin of all processors 
listed in the destination as an in- 
terrupt that originated in an exter- 
nally connected (8259A-compati- 
ble) interrupt controller. The Local 
Unit receiving this interrupt will ac- 
tivate ExtINTA in response to this 
interrupt message. A_ Delivery 
Mode of “ExtINT’ requires an 
“edge” Trigger Mode. (See the 
section on Compatibility for de- 
tails.) 
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Destination Mode [Bit 11] 


This field determines the interpretation 
of the Destination field. 


0: (Physical Mode): in Physical Mode, 
a destination 82489DX Local Unit is 
identified by its unit ID. Bits 56 
through 63 (8 MSB of the destina- 
tion field) specify the 8-bit unit ID. 


1: (Logical Mode): in Logical Mode, 
destinations are _ identified by 
matching on Logical Destination un- 
der the control of the Destination 
Format Register in each 82489DX 
Local Unit. The 32-bit Destination 
field is the logical destination. 


Delivery Status: [Bit 12] 


Delivery Status is a 1-bit field that con- 
tains the current status of the delivery 
of this interrupt. Two states are de- 
fined: 


0: (Idle) means that there is currently 
no activity for this interrupt; 


1: (Send Pending) indicates that the 
interrupt has been injected, but its 
delivery is temporarily held up by 
other recently injected interrupts 
that are in the process of being de- 
livered; Delivery Status is software 
read-only; software writes to this 
field (as part of a 32-bit word) do not 
affect this bit. 


Bit 13: Bit 13 is Reserved. Should be written 0. 


Remote IRR: [Bit 14] 


This bit is used for level triggered inter- 
rupts; its meaning is undefined for 
edge-triggered interrupts. Remote IRR 
mirrors the interrupt’s IRR bit of the 
destination 82489DX Local Unit. When 
the value of the bit disagrees with the 
state of the Interrupt Input line, a unit 
message is automatically sent to make 
the destination’s IRR both reflect the 
new state of the Interrupt Input line, 
and then the Remote IRR bit is updat- 
ed to track its associated IRR bit. Re- 
mote IRR is software read-only; soft- 
ware writes to this bit do not affect it. 
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Trigger Mode: [Bit 15] 


The Trigger Mode field indicates the 
type of signal on the interrupt pin that 
triggers an interrupt. 


0: indicates edge sensitive, 
1: indicates level sensitive. 


Mask: [Bit 16] 


Use this bit to mask injection of this in- 
terrupt. 


0: indicates that injection of this inter- 
rupt is not masked. An edge or level 
on an interrupt pin that is not 
masked results in the delivery of the 
interrupt to the destination. 


1: indicates that injection of this inter- 
rupt is masked. Edge-sensitive in- 
terrupts signaled on a masked inter- 
rupt Input pin are simply ignored 
(i.e., it is not delivered and is not 
held pending). Level-asserts or 
deasserts occurring on a masked 
level-sensitive pin are also ignored 
and have no side effects. As ex- 
pected, changing the mask bit from 
unmasked to masked while the lev- 
el remains asserted has the side ef- 
fect of deasserting the level. It is 
software’s responsibility to deal with 
the case where the Mask bit is set 
after the interrupt message has 
been sent but before the interrupt is 
dispensed to the processor. 


Bits [31:17] Bits [81:17] are reserved. Should be 
written 0. 


Destination 


Bits [63:32] 


Destination: If the Destination Mode of this entry is 
‘Physical Mode”, then the 8 MSB [bits 
56 through 63] contain an 82489DX 
Local Unit ID. If Logical Mode, then the 
Destination field potentially defines a 
set of processors. The interpretation of 
the 32-bit destination field is further en- 
abled by the Destination Format Regis- 
ter in the 82489DX Local Units. 
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8.0 ICC BUS DEFINITION 


Physical Characteristics 


The ICC bus is a 5-wire synchronous bus connecting 
all 82489DxXs (all |/O Units and all Local Units). Four 
of these five wires are used for data transmissions 
and arbitration, and one wire is a clock. The descrip- 
tion refers to the logical state of the ICC bus. Electri- 
cal levels are just inverse of the logical state de- 
scribed. For example, the section describes that the 
ICC bus is 0000 when not transmitting any message. 
This refers to logical state. Electrically, the ICC bus 
is 1111 when not transmitting any message. 


The bus is electrically an open-drain connection pro- 
viding for both bus use arbitration and arbitration for 
lowest priority. Being open-drain, the bus is run ata 
“comfortable” speed such that design-specific ter- 
mination tuning is not required. Furthermore, each 
82489DxX receiving a message or participating in an 
arbitration must be given enough time in a single bus 
cycle to latch the bus and perform some simple logic 
operations on the latched information in order to de- 
termine whether the next drive cycle must be inhibit- 
ed. 


82489DX 


82489DX 
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Note that it is likely in MP systems that additional 
processors be-located on plug-in boards. Since the 
ICC bus would be part of the connector, the 
82489DxX to ICC bus connection is defined so that it 
can be electrically isolated using external drivers. 
The 82489DX has separate ICC bus input and out- 
put pins that can be connected externally to the 
82489DxX to either provide or not provide isolation. 


The isolation can also be used to provide a hierar- 
chical connection of ICC buses electrically support- 
ing large numbers of processors. The number of 
82489DxXs supported using the hierarchical connec- 
tion is limited only by ICC bus bandwidth. It should 
be noted that ICC bus output low current is just 


4mA. 


Bus Arbitration 


Arbitration (both for use of the bus and for determin- 
ing the lowest priority 82489DX) depends on all 
82489DX message units operating synchronously. 
To deal with the event where multiple agents start 
transmitting simultaneously, a distributed arbitration 
approach is used. Bus arbitration uses a small num- 
ber of arbitration cycles in the ICC bus. During 


82489DX 


non-lIsolated ICC BUS 


290446-10 


Figure 20. ICC Bus: Simple Direct Connection 


82489DX 82489DX 82489DX 


82489DX 


Isolated ICC BUS 


290446-11 


Figure 21. ICC Bits: Hierachical Connection 
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these cycles, arbitration losers progressively drop 
off the bus until only the winner remains transmitting. 
The winner then transmits its actual inter-unit mes- 
sage. Once the sending of a message (including bus 
arbitration) has started, any possible contender must 


suppress transmission until enough cycles have 
elapsed for the message to be fully sent. The num- 


ber of message cycles depends on the type of mes-. 


sage being sent. 


A bus arbitration cycle starts by the agent driving its 
unit ID on the ICC bus. High-order ID bits are driven 
first, successive cycles proceeding to the low bits of 
the ID. All losers in a given cycle drop off the bus, 
using every subsequent cycle as a tie breaker for the 
previous cycle. By the time all arbitration cycles are 
completed, there will be only a single agent left driv- 
ing the bus. 


The 8-bit unit ID (17 16 15 14 13 12 11 10) is chopped up 
in successive groups of 2 bits (I7 16)(I5 14)(I3 12) 
(11 10). Each of these tuples is first decoded before 
driving them on the bus. The Os and 1 indicate logi- 
cal levels and not signal levels. The ICC bus is 0000 
when not transmitting any message. The decoding 
used is: 


Note that the pattern generated on the ICC bus by 
tuple (I3 12) will be represented as i32 i32 i32 i32. 
The lower case signifies this encoding. 


Each tuple of the ID only contributes to a single wire, 
making it possible for an agent to determine with 
certainty whether to “drop off’ or to continue arbi- 
trating in the next cycle for the following two bits of 
the unit ID simply by checking whether the bus line 
the agent is driving is also the highest order 1 on the 
bus. Each ICC bus cycle therefore arbitrates 2 bits. 


1: W6 176 i76 i76 ICC bus arbitration 
2: 154 i154 i54 i54 

3: i382 i382 i32 i32 

4: 


i110 i10 i10 110 
<message body > 
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Lowest-Priority Arbitration 


Arbitration is also used to find the 82489DX Local 
Unit with the lowest processor priority. Lowest-priori- 
ty arbitration uses the value of the 82489DX’s Proc- 
essor Priority value appended with an 8-bit Arbitra- 
tion ID (Arb ID) to break ties in case there are multi- 
ple units executing at the lowest priority. 


Using the constant 8-bit unit ID as the Arb ID has a 
tendency to skew symmetry since it would favor 
82489DXs with low ID values. An 82489DX Local 
Unit’s Arb ID is therefore not the unit ID itself but is 
derived from it. At reset, an 82489DX Local Unit’s 
Arb ID is equal to its unit ID. Each time a message is 
broadcast over the ICC bus in lowest priority mode, 
all 82489DX Local Units increment their Arb ID by 
one, which gives them a different Arb ID value for 
the next arbitration. The Arb ID is then endian-re- 
versed (LSB becomes MSB, etc.) to ensure better 
rotation of which 82489DX gets to have the lowest 
Arb ID next time around. The reversed Arb ID is then 
decoded to generate arbitration signals on the ICC 
bus as described above. 


To support hot insertion of processor boards in a 
running MP system, a mechanism is provided to al- 
low the 82489DX of the added processor to syn- 
chronize its Arb ID with the existing 82489DxXs. This 
is accomplished by broadcasting a message with 
Delivery Mode equal to ‘‘Reset’’, Trigger Mode equal 
to “Level”, and Level equal to 0. This message 
must be broadcast before the newly added 
82489DxX is allowed to participate in a lowest-priority 
arbitration. Depending on the exact sequence under 
which the newly inserted board is powered-up and 
initialized, this Arb ID synchronization may occur nat- 
urally if a Reset-deassert to the new 82489Dx is part 
of that sequence. If not, the local processor can al- 
ways send this as an inter processor interrupt (with a 
null destination), causing only Ihe side effect of re- 
setting all 82489DX Arb IDs. 


ICC BUS MESSAGE FORMATS 


The short message format is described first. Note 
that the first 19 cycles of both short and long mes- 
sage formats have the same interpretation. 

1: i76 i76 i76 i76 ICC bus arbitration 
2: 154 154 154 i54 
3: 82 i382. 82 82 
UNOS AO 1D AG 
5: DM M2 M1 MO destination mode and 
delivery mode 
control bits 


vector 


6: 778 “a” L ™ 
Fe VG. Vo Va 
SNe: ¥A0 Vi VO 
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destination 


checksum for 
cycle 5 through 
16 
post amble 
accept (1000 if 
OK, 1110 if 
preempt, else 

| error) 
EU... Ot » a... (a ® Oy ile 4 
219 Oe OF 5 TO a eae 


18: “~ “an ai. pote 
19: A A A A 


Cycles 1 through 4 are bus arbitration as described 
earlier. Cycle 5 (DM M2 M1 MO) is the Destination 
Mode which is 0 for Physical mode and 1 for Logical 
Mode, and the Delivery Mode of the message. The 
encoding used for the Delivery Mode in the message 
is identical to the encoding used for the Delivery 
Mode in the Redirection Table, Local Vector Table, 
and Interrupt Command Register. 


[wo | Delivery Node | 
Pree | rier 
To | <reservee> | 
: 
aa 
rae 


ExtINT 


Cycle 6 contains the Control Bits of the message. 
The control bits are: 


e TM (Trigger Mode): indicates whether this mes- 
sage corresponds to an edge or level; 


e L (Level): indicates whether this is an Assert or a 
Deassert of a “‘level’’ signal. L is undefined when 
TM is edge. 


6: “0” “0” L TM Control Bits 
TM = Trigger Mode (0 = edge, 1 = level) 
L = Level (0 = deassert, 1 = assert) 


The length of the message is derived from the Deliv- 
ery Mode, the Control Bits, and the Accept cycle of 
the message. 


TM/L (AAAA) 


Reset 
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Cycles 7 and 8 are the 8-bit interrupt vector. The 
vector is only defined for Delivery Modes Fixed, and 
Lowest-priority. For Delivery Mode of ‘Remote 


Read’, the vector field contains the address of the 
register to be read remotely. 


lf DM is 0 (physical mode), then cycles 9 and 10 are 
the unit ID and cycles 11 through 16 are zero. If DM 
is 1 (logical mode), then cycles 9 through 16 are the 
32-bit Destination field. The interpretation of the logi- 
cal mode 32-bit Destination field is performed by the 
Local Units using the Destination Format Register. 
The sending 82489DX knows whether it should (incl) 
or should not (excl) respond to its own message. 


Cycle 17 is a checksum over the data in cycles 5 
through 16. The checksum is computed by adding all 
4-bit quantities of cycles 5 through 16, feeding carry 
out of the MSB back into the LSB. This protects the 
data in these cycles against transmission errors. The 
(single) 82489DX driving the message provides this 
checksum in cycle 17. 


Cycle 18 is a post amble cycle driven as 1111 by the 
sending 82489DxX allowing all 82489DXs to perform 
various internal computations based on the informa- 
tion contained in the received message. One of the 
computations takes the computed checksum of the 
data received in cycles 5 through 16 and compares 
it against the value in cycle 17. If any 82489DX com- 
putes different checksum than the one passed in 
cycle 17, then that 82489DX will signal an error on 
the ICC bus in cycle 19 by driving it as 1111. If this 
happens, all 82489DXs will assume the message 
was never sent and the sender must try sending the 
message again, which includes re-arbitrating for the 
ICC bus. In lowest priority delivery when the interrupt 
has a focus processor, the focus 82489DX will sig- 
nal this by driving 1110 during cycle 19. This tells all 
the other 82489DxXs that the interrupt has been ac- 
cepted, the 82489Dxs is preempted, and short mes- 
sage format is used. All (non-focus) 82489DXs will 
drive 1000 in cycle 19. Under lowest priority mode, 
1000 implies that the interrupt currently has no focus 
processor and that priority arbitration is required to 
complete the delivery. In that case, long message 
format is used. If cycle 19 is 1000 for non Lowest 
Priority mode, then the message has been accepted 
and is considered sent. 


\ 


1S:EEEE 
1000 OK 
1110 preempt 


<others> error (drive error as 1111) 
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When an 82489DxX detects and reports an error dur- 
ing the error cycle, that 82489DxX will simply listen to 
the bus until it encounters two consecutive idle 
(0000) cycles. These two idle cycles indicate that 
the message has passed and a new message may 
be started by anyone. This allows an 82489DxX that 
got itself out of cycle on the ICC bus to get back in 
sync with the other 82489DxXs. 


Long Message Format 


Cycles 1 through 19 of the long message format are 
identical to cycles 1 through 19 of the short mes- 
sage format. As mentioned, long message format is 
used in two cases: 


(1) Lowest Priority delivery when the interrupt does 
not have a focus. Cycles 20 through 27 are eight 
arbitration cycles where the _ destination 
82489DXs determine the one 82489DX with 
lowest processor priority/ARB ID value. 


Remote Read messages. Cycles 20 through 27 
are the 32-bit content of the remotely read regis- 
ter. This information is driven on the bus by the 
remote 82489DX. 


(2 


— 


Cycle 28 is an Accept cycle. In lowest priority deliv- 
ery, all 82489DxXs that did not win the arbitration (in- 
cluding those that did not participate in the arbitra- 
tion) drive cycle 28 with 1000 (co accept), while the 
winner 82489DX drives 1111. If cycle 28 reads 1111, 
then all 82489DXs know that the interrupt has been 
accepted and the message is considered delivered. 
If cycle 28 reads 1100 (or anything but 1111 for that 
matter), then all 82489DXs assume the message 
was unaccepted or an error occurred during arbitra- 
tion. The message is considered undelivered, and 
the sending 82489DxX will try delivering the message 
again. 


For Remote Read messages, cycle 28 is driven as 
1100 by all 8 2489DXs except the responding re- 
mote 82489DxX, who drives the bus as 1111 in case 
it was able to successfully supply the requested data 
in cycles 20 through 27. If cycle 28 reads 1111 the 
data in cycles 20 through 27 is considered valid; oth- 
erwise, the data is considered invalid. The source 
82489DX that issued the Remote Read uses cycle 
28 to determine the state of the Remote Read 
Status field in the Interrupt Command Register (valid 
or invalid). In any case, a Remote Read request is 
always successful (although the data may be valid or 
invalid) in that a Remote Read is never retried. The 
reason for this is that Remote Read is a debug fea- 
ture, and a “hung” remote 82489DxX that is unable - 
to respond should not cause the debugger to hang. 
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Cycles 29 and 30 are two idle cycles. The ICC bus is 
available for sending the next message at cycle 31. 
The two idle cycles at the end of both short and long 
messages, together with non zero (i.e., non idle) en- 
coding for certain other bus cycles allow an ICC bus 
agent that happens to be out of phase by one cycle 
to sync back up in one message simply by waiting 
for two consecutive idle cycles after reporting its 
checksum error. This makes use of the fact that val- 
id arbitration cycles are never 0000. 


i76 i76 i76 i76 
54 i54 i54 i54 
is2 .i@2 632 2 
i10 i10 i10 i10 
M2 M1 MO delivery mode 
“Y “O° kL - TM @ontol bits 
V7 V6 V5 V4 vector 

Va (2 »¥T vo 

: D30 D29 D28 destination 
10: D27° D026. D25. O24 

11:-. 023° D022 -D21.. 020 

12: D19 D18 D117 D116 


ICC bus arbitration 


Payee Fears 
O 
= 


ce) 
O 
a) 
eat. 


checksum for cycles 5 

through 16 

1 te ha a. St: postampie 

accept (1000 if OK, 

1110 if preempt, 

else error) 

20: p76 p76 p76 p76 lowest priority 

arbitration or 32 bits 

| of remote register 

21: p54 p54 p54 p54 processor priority 

22: p32 p32 p32 p32 

23: p10 pi0 p10 pi0 

24: a76 a76 a76 a76 

25: a54 a54 a54 a54 arbitration ID 

26: a32 a32 a32 a32 

27: aiO aiO aiO at10 

28: A A A. A_ accept 

2a: OO" “0 . “ON. ““O” 11a 

a0; “Oo” “Oo “Oo” ~-“O” .idis2 


9.0 HARDWARE TIMINGS 


This section covers the following: 
— Timing Diagram Notation 
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— 82489DX Register Access Timing Diagrams with 
Descriptions 


A block diagram of the configuration of the CPU 
module of a MP system is shown. This in no way is 
intended to be a complete representation of 
486/intel Cache/Intel Cache Controller connec- 
tions. It is intended to show all the 82489DX connec- 
tions, and how they connect to other components 
on and off the module. This module has arbitrarily 
been drawn with a 64-bit data bus to show how the 
expanded address space architecture fits. The unit 
can be similarly attached to either a 32-bit or 128-bit 
data bus, with total transparency to shrink-wrap soft- 
ware. 


In this configuration, the 82489DX uses the same 
clock source as the processor and cache. However, 
it is quite possible to consider 82489DX as a memo- 
ry bus device and hence supply 82489DX with the 
memory bus clock, which can be slower than the 
CPU module clock frequency. 


In the configuration shown, the processor’s INT and 
NMI pins could be supplied by other source to allow 
for the possibility that 82489DX can be totally by- 
passed if desired, by allowing those signals to be 
driven from off the module while the 82489Dx is dis- 
abled. The reset signal generated by the 82489DX 
goes to the MBC (memory bus controller) which is 
required to drive configuration lines at reset time. 
This would probably be configured as a “warm’’ re- 
set by the MBC. 


A future version of cache controller may generate 
the chip select for 82489DxX at a fixed memory loca- 
tion of hexFEE00000. By having the cache controller 
to provide the chip select signal, it would encourage 
a standard mapping for 82489DX address space. In 
some MBC designs, this signal should be connected 
to the MBC since 82489DxX cycles limit bus pipelin- 
ing by constraining how soon the next bus cycle can 
come. The 82489DX chip select can be generated 
by the MBC completely. 


The address, data and most of the bus control sig- 
nals share the respective bus with cache and cache 
controller. The block diagram shows attachment for 
only 6 address lines: A4—A9. A10 should be 0. This 
is all the 82489DX needs for operation, however, if 
the address lines are used to initialize 82489DxX lo- 
cal ID at reset time, 8 address lines are required, 
A3-A10. 
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INTERFACING TO THE ICC BUS 


The 82489DX has separate ICC bus input and out- 
put pins to facilitate using external drivers. The ICC 
bus input pins (MBIO-3) are TTL-level compatible 
CMOS inputs. The output pins (MBOO-3) are open- 
drain pins which required external pull-ups. The 
open-drain output buffers are small buffers with: 


Sink current of < 4 mA. Special consideration 
must be exercised when driving large capaci- 
tive loads or long transmission lines. The pull- 
up resistor and the capacitive load constitute 
RC time constant that will affect the output 
transition times. This in turn will limit the oper- 
ating frequency of the ICC bus. 


When designing in the ICC bus, one needs to 
consider the loads that each 82489DxX will be 
driving and whether external drivers should 
be used. In most situations, the ICC bus driv- 
en high (MBO pins pulled high by the external 
pull-up resistors) poses the most challenge. 
Simulating the target design on an electrical 
simulator (such as SPICE) will help greatly. 
as shown in the following examples. 


First Order Buffer Models 


Figure 21a and 21b are first order input buffer and 
output buffer models of the MBI and MBO pins. The 
open-drain of the MBO is modeled as a switch as 
the primary interest here is the MBO pins going high. 
These models can be used on SPICE simulations to 


82489DX 


obtain first order behaviors. The parameters for 
these models are as follows: 


Cp (package capacitance) = 3 pF 

Lp (package inductance) = 15 nH 

Rb (bond wire resistance) = 0.089 

Ci (input buffer capacitance) = 3 pF 

Co (output buffer capacitance) = 6 pF 

Ro (output buffer impedance) = 309-800 


MBO Pull-up Resistor 


To minimize the RC time constant, one would like to 
use the smallest pull-up resistor value possible. The 
MBO pins has a worst case lol-spec of 4 mA and 
Voc = 4.75V. This translates to a minimum pull-up 
of about 1 KM. Where stronger drive is needed 
(smaller pull-up resistors), external drivers must be 
used. | 


Driving Lumped Capacitance 


In systems where external drivers are not used, the 
MBI pins will be tied to the MBO pins. Figure 21disa 
SPICE simulation of the MBO output with a 1 KO 
pull-up driving lumped capacitive loads from 10 pF 
to 150 pF. 


At a load of 50 pF, it takes about 30 ns to charge up 
to 2V. At 100 pF, it takes an additional 25 ns. Figure 
21d can be used to estimate the loading delay at 
different lumped capacitive loads. 
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Figure 21b. First Order Open-Drain Output Buffer for MBO Pins 


ADVANCE INFORMATION 


2-93 


‘ —_," i ¥ , ‘ o hg \ : Ls 4 p ote Poa vs ws aa . % ? . * 
ae a ee OE DL RA ARR ag Gh ae a le 
‘ a te Be wy 13% i. j ei F ; *~ - Bee . iu ‘ ’ 


~ * 


’ a 

82489DX : intel : 
In real systems, the loads are made up of lumped The ICC bus is shared by two 82489DxXs, one at 
capacitance and transmission lines. More accurate each end. The ICC bus is modeled as a transmission 
results can be obtained using transmission line mod- line. For the simulation, only one of the 82489DxX is 
els. driving. A pull-up resistor of 2 KO is used at each 
end (1 Kf. equivalent value) as shown in Figure 21e. 
aa 9 va ; Figure 21f shows the signals at each end of the 12 
Driving Transmission Lines inch transmission line. Trace 1 is the wave form at 
the driven end and trace 2 is the signal at the receiv- 
ing end of the line. The 2 ns delay between the two 
signals is the propagation delay (or flight time) 
through the 12 inch transmission line. It takes about 

35 ns for the voltage to charge up to 2V. 


Two device model 


In this example the ICC bus is a signal line on an 
FR-4 printed circuit board. The line width is 6 mils. 
Line length of 12 inches and 18 inches are modeled. 


WAGER Fee Phe DGGrGs Ra: Bam SOUS OnE SARA VIE Figure 21g shows the received signal with different 


ue ee line length and with additional lumped capacitance. 
resistivity = 0.6 m/sq. (0.1/inch for 6 mil Trace 1 is for 12 inch only. Trace 2 is for 12 inch with 
width) additional 20 pF lumped capacitance to represent’ 

inductance = 60 pH/sq. (10 nH/inch for 6 mil interconnect socket capacitance. Trace 3 is for 18 
width) inch plus 20 pF. The presence of the 20 pF at each 


end of the 12-inch transmission line increases the 


Capacitance = 0.55 nF/sq. in. (3.3 pF/inch for 6 delay time by 20 ns at 2V. 


mil width) 


+5V 
R 


puctup = 1 Ko 
MBO i 
MBI C 
| D 
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Figure 21c. ICC Bus Driving Lumped Capacitance 


100.0ns 150.0ns 
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Figure 21d. 1 KQ. Pull-Up Driving Lumped Capacitance 
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Four Device Model 


In this example (Figure 21h), the ICC bus is a 
12-inch transmission line with four 82489DXs con- 
nected at 4 inch intervals. The loading at each junc- 
tion consisted of the MBI and MBO buffers and a 
20 pF lumped capacitance. 2 K. pull-ups are at 
each end of the transmission line. 


As shown in Figure 21i, it takes more than 90 ns for 
the signal level at both ends to reach 2V. 


82489DX 
MBO 


82489DX 


One way to improve the low to high transition time is 
to use a stronger pull-up (smaller resistor value) 
which is possible using external line drivers with their 
larger current drive capabilities. 


Figure 21j shows the difference in output when the 


model is used with 3002 pull-ups at each end of the 
transmission line. 
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Figure 21f. Waveform at Both Ends of 12” T-line 
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External Drivers/Buffered ICC bus 


The 82489DX has separate ICC Bus input (MBI) and 
output (MBO) pins that can be connected to external 
line drivers in systems that has appreciable loading 
on the ICC Bus or where modularity of the bus is 
needed. 


Figure 21k is a typical implementation using external 
drivers with tri-state outputs. Drivers such as 74F125 
or its equivalent can be used. The drivers should be 
placed as close to the MBO pins as possible. The 
input buffer on MBI is optional depending on the us- 


intel. 


ers ICC Bus scheme. The total delay through the 
drivers, buffers, transmission line, clock skews etc. 
must be calculated to ensure that all the ICC bus 
timing requirements are met. 


A hierarchical bus connection can also be used in 
applications that cannot afford driver/buffer per unit 
and where bus loading are localized in cluster 
groups. Figure 211 shows such a connection where 
each cluster group is connected directly and drivers 
are used to connect to other clusters. Each cluster 
group is assumed to be close together physically 
with small loading on the local ICC bus. 


100.0ns 


150.0 ns 
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Figure 21h. Four 82489DX Configuration 
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3009, each end 


100.0ns 1:50: ate 
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Figure 21j. Waveform with Different Pull-Ups 
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Transmission Line Termination 


As with all high speed designs, one has to consider 
transmission line effects on signals, especially clock 
signals. Even though the ICC bus clock, ICLK is usu- 
ally operated in the 10 MHz range, one has to con- 
sider proper transmission line termination also for 
short rise times. Figure 21m shows the ICLK wave 
form at the end of a 12 inch T-line when driven by a 
clock generator with and without series matched ter- 
mination. 


Series termination should not be used for the ICC 
bus data lines (MBO). The combination of the pull-up 
resistor and series resistor would degrade the output 
low voltage, Vol. For example, with a pull-up of 3000 
and a series termination of 509 at each end, the Vol 
voltage at the receiving end would be at 1.55V if the 
driving end is at 0.4V (see Figure 21n). 


ICC BUS Operating Frequency 


The 82489DX ICC BUS has a design target of oper- 
ating up to 16 MHz (62 ns period). As shown in the 
examples above, the MBO low-to-high transition 
times are strongly dictated by the loads and the pull- 
ups used. This will in turn affect the maximum oper- 
ating frequency of ICLK. 


In general, the minimum period is the larger of 62 ns 
or MBO-to-MBI low data time or MBO-to-MBI high 
data time. 


MBO-to-MBI low time = 


(ICLK skew + MBO valid low delay + T-line 
prop.delay + ext. buffer delay + MBI setup 


time) 
*5V | 82489Dx 


1K 


Note: 
Input buffer is optional 


intel. 
MBO-to-MBI high time = 


(ICLK skew + MBO Hi-Z delay + pull-high 
time + T-line prop.delay + ext. buffer delay 
+ MBI setup time) 


Maximum MBO valid low delay = 50 ns 
Maximum MBO H-Z delay = 15 ns 
MBI minimum setup time = 8 ns 


In the example shown earlier where two 82489DXs 
are at each end of a 12-inch T-line with no other 
loads, the pull-high time to 2V is 35 ns (trace 1 in 
Figure 21g). If the ICLK skew is 2 ns, then this con- 
figuration can operate to 62 ns period or 16 MHz. 


If the same configuration has additional 20 pF loads 
at each end, then the pull-high time is 55 ns (trace 2 
in Figure 21g). The maximum frequency decreases 
to 12 MHz (82 ns period). 


In the four device model discussed earlier, where 
the ICC Bus is unbuffered, the pull-high time is 90 ns 
(Figure 21j). The operating frequency will be less 
than 8 MHz (117 ns period). If external buffers are 
used (whereby allowing use of 3002 pull-up) and 
assuming the external buffers have delays of 10 ns, 
the operating frequency is limited by the MBO-to- 
MBI low time of 72 ns or 14 MHz. 


NOTE: 
Each application is unique in its configuration and 
loading on the ICC Bus. The above examples high- 
lighted some of the factors that need to be consid- 
ered. It is important to do electrical simulation to 
ascertain if the propose implementation is viable 
before committing to the printed circuit board. 


*5V | g2489Dx 
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Figure 21k. External Driver/Buffer Implementation 
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Figure 21n. Effect of Series Termination on MBO VOL 
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Figure 22. Possible Configuration of the CPU Module 


9.1 82489DX Register Access Timing 


This section provides descriptions of the four basic 
cycle timings for the 82489DX, which are: 


— Register Write 

— Register Read 

— Interrupt Acknowledge 
— Reset 


In addition, Inter-Unit (nibble) bus timings are pre- 
sented. 


Register access occurs in three distinct phases: 
1. Control Phase, 

2. Address Phase and 

3. Data Phase. 


They always occur in this order, although in some 


cases address and data phases can occur in the 
‘same clock cycle, as will be seen in the diagrams. 
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NOTE: 
As mentioned previously, the clock signal in all 
these timing diagrams is assumed to be the proces- 
sor clock. 


TIMING DIAGRAM NOTATION 


The 82489DX bus (register access) interface is syn- 
chronous. Unless otherwise noted, setup and hold 
times for inputs, and delay times for outputs are 
measured with respect to a rising clock edge. There- 
fore the timing diagrams contain very few construc- 
tion lines to identify timing parameters and ones that 
exist are explicitly discussed. High and low level for 
signals are obvious; dashed lines at the middle of 
the signal range indicate a tristate output buffer in 
high impedance mode; hashed or cross-hatched 
lines indicate a “don’t care’’ state for inputs. 


Cycle expansion marks—short curved lines breaking 
the waveform—appear throughout. These appear in 
sets or groups which are identified by construction 
arrows and/or vertical column alignment. Conditions 
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resulting in cycle expansion are listed at the bottom 
of each diagram, and the associated set of expan- 
sion marks indicates which signals must be 
stretched for that condition. Signals not so indicated 
are not affected by that condition, and continue with- 
out any cycle stretching. For example, the data 
phase of a transaction may be delayed, stretching all 
data related signals, while address related signals 
can continue to the next cycle. 


Sample points for input signals are marked with 
bold, downward pointing arrows. Since sample tim- 
ing for address, data and cycle definition signals is 
dependent on the timing of related control signals, a 
bar is used on top of the arrow to indicate the signal 
with the independent timing. Each signal group has 
exactly one independent signal. In general, indepen- 
dent signals are sampled on every clock, and there- 
fore must meet setup and hold times on every clock 
edge. Signals having dependent timing, (indicated 
by the arrow with no bar), are only sampled when 
the associated independent signal is active, and 
therefore setup and hold times for dependent sig- 
nals need only be met at the indicated sample 
points. 


REGISTER WRITE TIMING 


For discussion of this bus cycle, refer to Figure 23, 
82489DX Timing Diagram 1. This shows the relation- 
ship between the three phases of the bus cycle. 


The contro! phase is independently timed by the 
ADS signal. The cycle definition signals [M/IO, D/C, 
are dependently sampled with ADS as indicated by 
the bold sample point arrows labeled “C’’. The cycle 
definition signals will be sampled in the first clock 
when ADS is active (low). The control signal should 
remain stable until ADS goes inactive. For any valid 
82489DX cycle, the memory bus controller should 
ensure that the ADS pulse for a subsequent bus cy- 
cle is NOT presented until after the 82489DX as- 
serts its RDY pin (low) as shown. 


The address phase is independently timed by the 
(BGT) signal, as indicated by the bold sample point 
arrows labeled “A”. This signal is actually used an 
address latch enable, however, its name is intended 
to imply that in most cases it can be directly driven 
by the Intel cache controller signal of the same 
name. The BGT pulse may be delayed until the ad- 
dress bus is available, in which case all address and 
data phase signals will be stretched. Note that DLE 
must not occur before BGT. 82489DX does not start 
the internal cycle until BGT is recognized with the 
appropriate chip select signal, If multiple ADS has 
been issued without BGT and a valid chip se- 
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lect, the internal cycle starts with the most recent 
ADS cycle definition preceding the BGT with valid 
chip select. 


NOTE C: 
Address information, including chip select (CS), is 
sampled in the first clock when BGT is active (low), 
and they must remain stable until BGT goes inac- 
tive, OR until RDY is asserted (low), whichever oc- 
curs first. CS should be stable when ADS is active. 


In some configurations, BGT may not be provided, 
and can be permanently tied low. In this case, the 
independent address timing will occur exactly one 
clock after the ADS signal is first sampled low, and 
the dependent address information (address and 
chip select) will be assumed stable at this time. 
82489DX recognize that independent BGT timing is 
not provided by sampling a low state of BGT at the 
time ADS is first sampled low. 


The data phase is independently timed by the DLE 
signal, as indicated by the bold sample point arrows 
labeled “D’’. In the case of register writes, this signal 
work logically like a synchronous data latch enable. 
The DLE pulse may be delayed until the data bus is 
available, in which case data and RDY will be 
stretched. 


NOTE D: feat. 
Write data are sampled the first clock when DLE is 
active (low), and should remain stable until DLE 
goes inactive, OR until RDY is asserted (low), 
whichever comes first. 


In some configurations, DLE may not be provided, 
and can be permanently tied low. In this case, the 
independent data timing will occur exactly one clock 
after the ADS signal is first sampled low, OR on the 
same clock as BGT is first sampled low, whichever 
occurs later. The data bus will be assumed stable at 
this time. 82489DX recognize that independent DLE 
timing is not provided by sampling a low state of DLE 
at the time ADS is first sampled low. 


Cycle completion is signaled by the RDY signal. Its 
relative positioning on any of the timing diagrams 
does NOT imply the number of clock cycles required 
for an access. RDY is delayed as needed in order for 
the 82489DxX to complete the cycle. It is then assert- 
ed (low) for one clock cycle and then deasserted. 
Again, the next ADS cannot start until after RDY has - 
been driven low. ADS must return to an inactive high 
state before the next cycle can be issued. It is highly 
recommended not to have, ADS more than one 
clock wide. 
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REGISTER READ TIMING 


For discussion of this bus cycle, refer to Figure 24, 
82489DxX timing Diagram 2. It shows the relationship 
of the three phases of the bus cycle, however, the 
dependent control and address signals are not 
shown here, since they behave exactly as in the 
case of a register write. See the previous section for 
the description of control and address phases of the 
bus cycle. 


In the case of aread when DLE is used (Figure 244A), 
it works logically like an asynchronous, output data 
enable. The 82489DX drives the data bus within 
time delay “A” after DLE is asserted, which must not 
occur before BGT. Note that even though the bus is 
being driven, the data only becomes valid during the 
clock cycle in which RDY is asserted, after that 
point, valid data is maintained on the bus as long as 
DLE remains asserted, after which the data bus re- 
turns to high impedance state within time delay “B”’ 
of DLE deassertion. If DLE is asserted late, 
82489DX could complete its internal read cycle and 
return RDY early. In that case, RDY active low state 
will be maintained until DLE is asserted. 


In the case of a read when DLE is NOT used (Figure 
24B) the data is driven for exactly one clock cycle, 
coincident with RDY being asserted. 


DLE is sampled with the control signals to determine 
whether it is being used. If sampled in the asserted 
state when ADS is active, the DLE will be consid- 
ered not used, and its state during the remainder of 
the cycle doesn’t matter. This is consistent with the 
notion of permanently tying this signal low when not 
used, as described in the previous section. 


Indication of the end of the bus cycle is dependent 
on the use of DLE. When it is not used, (i.e., perma- 
nently tied to ground) RDY indicates the end of the 
bus cycle, as it does in the case of write access. 
When it is used, DLE deassertion indicates the end 
of the cycle, since the 82489DX could be driving the 
bus well after RDY is deasserted. In either case, the 
82489DX can accept the next ADS pulse anytime 
after RDY has been asserted. However, note that if 
DLE is being used, the next ADS should be delayed 
until DLE can be safely sampled inactive. Note these 
two options for earliest next cycle in the timing dia- 
gram. 


INTERRUPT ACKNOWLEDGE TIMING 


For discussion of this bus cycle, refer to Figure 25 
82489DxX timing diagram 3, This cycle is the result of 
the 82489DX posting an interrupt to the processor 
by asserting PINT. After PINT is asserted, other bus 
cycles may occur before the interrupt acknowledge 
cycle. 
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PINT can be asserted for any external (8259) inter- 
rupt as shown in Figure 25A, or for an 82489DX gen- 
erated interrupt as shown in Figure 25B. ExtINTA 
indicates whether PINT was asserted in response to 
an 8259 request or an 82489DX request. This signal 
is used by external control logic to either allow or 
preciude the 8259 from responding to the subse- 
quent interrupt acknowledge cycle. It should be not- 
ed that ExtINTA pin gets deactivated at third clock 
after the ADS of the second INTA cycle. 


When ExtINTA is high, the 82489DxX will not respond 
to the acknowledge cycle other than to deassert 
PINT signal, and clear the pending external interrupt 
two full clock cycles after the ADS of the second 
INTA cycle, as shown in Figure 25A. Once PINT is 
asserted in response to an external interrupt, it can 
only be deasserted by an INTA cycle. An INTA cycle 
is recognized by 82489DX as soon as the bus cycle 
definition is sampled with ADS in a low state. BGT is 
not needed in this case. 


When ExtINTA is low, the 82489DxX will respond to 
the acknowledge cycle, as shown in Figure 25B. In 
this case, external logic (e.g., the memory bus con- 
troller) is expected to prevent any attached 8259 
from seeing the acknowledge cycle. When PINT is 
asserted in response to an 82489DxX internal inter- 
rupt, it can be deasserted by an INTA cycle. The 
PINT signal will be deasserted 5 full clocks after 
BGT of the second INTA cycle. 


Note that ExtINTA is stable at all times while PINT is 
asserted. That means that even if new interrupts ar- 
rive between the time an interrupt is posted to the 
processor, and the acknowledge occurs, the 
82489DX will not change its commitment for an ex- 
ternal (8259) or internal (82489DX) acknowledge cy- 
cle, regardless of priority. This also means that PINT 
may be raised for a high priority internal interrupt 
right after responding to the external Interrupt. In any 
event, PINT will be kept low for a minimum of two 
clocks before reasserting itself. 


The interrupt acknowledge cycle is indicated by the 
bus cycle definition signals all being low, and looks 
like two consecutive read cycles, except that there is 
no explicit address information. The actual content 
of the address pins during this cycle is processor 
dependent, and therefore there is no chip select ei- 
ther. Chip select is implied by a combination of the 
bus cycle definition signals (all low) and BGT. | 


Note that there is a “dummy” data phase in the first 
interrupt acknowledge cycle. This allows parity to be 
generated on the bus for processors like i860XP. 
During this cycle, 82489DX drives random data on 
the bus with the appropriate parity. The interrupt Re- 
quest Register is ‘frozen’ and the highest priority 
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pending interrupt vector is returned to the processor 
in the second acknowledge cycle. 


The second acknowledge cycle has a complete data 
phase with timings identical to those of an ordinary 
register read. The data returned is the vector of the 
highest priority internally pending 82489Dx interrupt, 
or the spurious interrupt vector, if there is no inter- 
rupt pending higher than the current processor prior- 
ity. 


Note that the timing diagram shows DLE being used 
(sampled high during ADS). However, just as in a 


normal read cycle, the option exists not to use DL 
(i.e., permanently tied to ground). 


RESET AND MISCELLANEOUS TIMING 


For discussion of this bus cycle, refer to Figure 26 
82489DX Timing Diagram 4. It shows the 82489DX 
reset cycle, the timing of some related signals, and 
the ICC bus. The RESET input has a setup and hold 
time to the system clock edge, CLKIN, as do other 
independently timed signals, The RESET signal will 
reset the two asynchronous system on the chip, 
namely the ICC bus unit running synchronously to 
the ICLK and all the other unit running synchronous 
to the system clock, CLKIN. RESET must meet the 
minimum reset time with respect to both clocks and 
there should be at least one ICLK rising edge during 
reset. The TAP controller should also be initialized. 


During reset, an eight-bit 82489DX Local Unit ID can 
be optionally initialized. Eight address lines, A10-—A3 
.are sampled on every clock edge while RESET is 
asserted. The last sample remains in the 82489DX 
Local Unit ID register after reset. Alternatively, the 
82489DX Local Unit ID can be loaded with a register 
write as part of software initialization, before 
82489DxX operation is started. In any event, the reg- 
ister must be initialized before the 82489DX can 
communicate on the ICC bus, including sending/re- 
ceiving RESET messages. All valid signal to 
82489DX should wait at least two full clocks after 
RESET is deasserted. 


The PRST signal (reset output) is asserted both with 
RESET input, or under software control. Its on-off 
delay times are relative to the rising clock edge. The 
duration of PRST under software control is defined 
by the software itself. Also note that the PNMI pin 
has the same timing as does PRST when the latter 
is software controlled. 


The ICC bus signals are both input and output on 
each cycle. Setup, hold and delay times are all mea- 
sured with respect to the ICC bus Clock ICLK which 
has no relationship to the Processor clock on which 
the remainder of the 82489DX runs. This means 
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that the ICC bus is independently sampled on each 
ICLK edge, as shown. It also implies that largest 
possible hold time will not exceed the minimum de- 
lay time. 


After reset, all 82489DX registers are reset to ‘‘0” 
state. The mask bits in the local vector table and the 
redirection table are reset to “1” state to mask out 
all interrupts. All reserved bits are all wired to “0” 
state permanently on chip. 


10.0 BOUNDARY SCAN 
DESCRIPTION 


The 82489DX is equipped with the JTAG boundary 
scan standard. This feature allows the user to test 
the interconnections between 82489DX and the ex- 
ternal hardware once they have been assembled 
onto a printed circuit board or other substrate. In 
addition to the JTAG mandatory instruction set, 
82489DX also provides the INTEST instruction 
which allows static testing of the on-chip logic. 


The detailed information related to the IEEE Std 
1149.1-1990 (the JTAG standard) can be obtained 
from the reference document /EEE Standard Test 
Access Port and Boundary Scan Architecture (IEEE 
Std 1149.1-—1990). 


10.1 Boundary Scan Architecture 


The boundary scan logic contains the following ele- 
ments: 


— Five Test Access Ports (TAP): They are la- 
beled as trst, tck, tdi, tdo and tms. All ports are 
input pins except tdo, which is a tri-state output 
pin. 

— A TAP Controller: The logic is used to control 
the boundary scan activity. 


— 82489DX Device ID Register: This is a 32-bit 
read-only register. The DID can be shifted out in 
ascending order to the tdo pin. 


— JTAG Instruction Register (IR): This is a 4-bit 
register which accepts instruction code shifted in 
from the tdi pin. The opcode stored in the IR reg- 
ister is used to control operation. 


— Boundary Scan Register: This is a 137 stages 
scan path which connects almost all 82489DX 
signal pins for boundary scan purposes. 


— Bypass Register: This register simply allows the 
data which goes into tdi pin to be shifted out di- 
rectly from tdo. 


The following block diagram illustrates the imple- 


mentation of the JTAG architecture in the 82489DX 
design. 
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Figure 27. Block Diagram of the JTAG Architecture 


Test Access Ports 


trst TAP controller master reset pin. When trst is 
low, the TAP controller’s state machine will be 
reset to ‘‘test-logic-reset’’ state asynchronous- 
ly. This pin is tied to a weak internal pull-up for 
keeping to be a logical 1 when not driven. 


tck This is the test logic clock. The test logic will 
change state on the rising edge of the tck. 


tdi Test data input. Data is shifted into the tdi pin 
on the rising edge of tck. This pin is tied to a 
weak internal pull-up for keeping it to be a logi- 
cal 1 when not driven. | 


tms Test mode select. This pin is used to select the 
state of the TAP controller. This pin is synchro- 
nous to the rising edge of the tck. This pin is 
tied to a weak internal pull-up for keeping it to 
be a logical 1 when not driven. 
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tdo Test data output. This is a tri-state pin which 
allows the data to be shifted out. 


TAP CONTROLLER 


The TAP controller in 82489DX is implemented to 
conform the IEEE1149.1 standard. The TAP control- 
ler is a single phase clock, synchronous finite state 
machine. It controls the sequence of the operation 
of the test logic. 


The value of the test mode state (tms) pin at a rising 
edge of tck controls the sequence of the state 
changes. The state diagram for the TAP controller is 
shown in Figure 28. Test designers must consider 
the operation of the state machine in order to have 
the correct sequence of value to drive on tms. 


The behavior of the TAP controller and other test 


logic in each of the controller states is briefly de- 
scribed as follows. 
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Figure 28. TAP Controller State Diagram 


TEST-LOGIC-RESET 


The test logic is disabled so that normal operation of 
the on-chip system logic (i.e., in response to stimuli 
received through the system pins only) can continue 
unhindered. This is achieved by initializing the in- 
struction register to contain the IDCODE instruction. 
No matter what the original state of the controller, it 
will enter Test-Logic-Reset when tms is held high for 
at least five rising edges of tck. The controller re- 
mains in this state while tms is high. 


If the controller should leave the Test-Logic-Reset 
controller state as a result of an erroneous low sig- 
nal on the tms line at the time of the rising edge on 
tck (for example, a glitch due to external interfer- 
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ence), it will return to the Test-Logic-Reset state fol- 
lowing three rising edges of tck with the tms line at 
the intended high logic level. The operation of the 
test logic is such that no disturbance is caused to 
on-chip system logic operation as the results of such 
an error. On leaving the Test-Logic-Reset controller 
state, the controller moves into the Run-Test/Idle 
controller state where no action will occur because 
the current instruction has been set to select opera- 
tion of the device identification register. The test log- - 
ic is also inactive in the Select-DR-Scan and Select- 
IR-Scan controller states. 


Note that the TAP controller will also be forced to 


the Test-Logic-Reset controller state asynchronous- 
ly by applying a low logic level at trst. 
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RUN-TEST/IDLE 


A controller state between scan operations. Once 
entered, the controller will remain in the Run-Test/ 
Idle state as long as tms is held low. When tms is 
high and a rising edge is applied at tck, the controller 
moves to the Select-DR-Scan state. 


In the Run-Test/Idle controller state, activity in se- 
lected test logic occurs only when certain instruc- 
tions are present such as RUNBIST. Since 82489DX 
does not have RUNBIST instruction, this state is act- 
ing like an idle state. 


The instruction does not change while the TAP con- 
troller is in this state. 


SELECT-DR-SCAN 


This is a temporary controller state in which all test 
data register (82489DX has one test data register 
which is the boundary scan shift registers path) se- 
lected by the current instruction retain their previous 
state. | 


If tms is held low and a rising edge is applied to tck 
when the controller is in this state, then the control- 
ler moves into the Capture-DR state and a scan se- 
quence for the selected test data register is initiated. 
If tms is held high and a rising edge is applied to tck, 
the controller moves on to the Select-IR-Scan state. 


The instruction does not change while the TAP con- 
troller is in this state. 


SELECT-IR-SCAN 


This is a temporary controller state in which all test 
data registers selected by the current instructing re- 
tain their previous state. 


If tms is held low and a rising edge is applied to tck 
when the controller is in this state, then the control- 
ler moves into the Capture-IR state and a scan se- 
quence for the instruction register is initiated. If tms 
is held high and a rising edge is applied to tck, the 
controller returns to the Test-Logic-Reset state. The 
instruction does not change while the TAP con- 
troller Is In this state 


CAPTURE-DR 


In this controller state data may be parallel-loaded 
into test data registers selected by the current in- 
struction on the rising edge of tck. If a test data reg- 
ister selected by the current instruction does not 
have a parallel input, or if capturing is not required 
for the selected test, then the register retains its pre- 
vious state unchanged. The instruction does not 
change while the TAP controller is in this state. 
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When the TAP controller is in this state and a rising 
edge is applied to tck, the controller enters either the 


Exit 1-DR state if tms is held at 1 or the Shift-DR 
state if tms is held at 0. 


SHIFT-DR 


In this controller state, the test data register con- 
nected between tdi and tdo as a result of the current 


instruction shifts data one stage towards its serial 


output on each rising edge of tck. Test data registers 
that are selected by the current instruction, but are 
not placed in the serial path, retain their previous 
state unchanged. The instruction does not 
change while the TAP controller is In this state. 


When the TAP controller is in this state and a rising 
edge is applied to tck, the controller enters either the 
Exit 1-DR state if tms is held at 1 or remains in the 
Shift-DR state if tms is held at 0. 


EXIT 1-DR 


This is a temporary controller state, if tms is held 
high, a rising edge applied to tck while in this state 
causes the controller to enter the Update-DR state, 
which terminates the scanning process. If tms is 
held low and a rising edge is applied to tck, the con- 
troller enters the Pause-DR state. All test date reg- 
isters selected by the current instructions retain 
their previous state unchanged. 


The instruction does not change while the TAP con- 
troller is in this state. 


PAUSE-DR 


This controller state allows shifting of the test data 
register in the serial path between tdi and tdo to be 
temporarily halted. All test data registers selected by 
the current instruction retain their previous state un- 
changed. 


The controller remains in this state while tms is low. 
When tms goes high and a rising edge is applied to 
tck, the controller moves on to the Exit 2-DR state. 
The instruction does not change while the TAP con- 
troller is in this state. 


EXIT 2-DR 


This is a temporary controller state. If tms is held 
high and a rising edge is applied to tck while in this 
state, the scanning process terminates and the TAP 
controller enters the Update-DR controller state. If 
tms is held low and a rising edge is applied to tck, 
the controller enters the Shift-DR state. 


ADVANCE INFORMATION : 


| | 

intel. 
All test data registers selected by the current in- 
struction retain their previous state unchanged. The 


instruction does not change while the TAP controller 
is in this state. 


UPDATE-DR 


Some test date registers may be provided with a 
latched parallel output to prevent changes at the 
parallel output while data is shifted in the associated 
shift-register path in response to certain instructions 
(e.g., EXTENT, INTEST, and RUNBIST). Data is 
latched onto the parallel output of these test data 
registers from the shift-register path on the falling 
edge of tck in the Update-DR controller state. The 
data held at the latched parallel output should not 
change other than in this controller state unless op- 
eration during the execution of a self test is required 
(e.g., during the Run-Test/Idle controller state in re- 
sponse to a design-specific public instruction). 


All shift-register stages in test data registers select- 
ed by the current instruction retain their previous 
state unchanged. The instruction does not 
change while the TAP controller is in this state. 


When the TAP controller is in this state and a rising 
edge is applied to tck, the controller enters either the 
Select-DR-Scan state if tms is held at 1 or the Run- 
Test/Idle state if tms is held at 0. 


CAPTURE-IR 


In this controller state the shift-register contained in 
the instruction register loads a pattern of fixed logic 
values on the rising edge of tck. In addition, design- 
specific data may be loaded into shift-register stages 
that are not required to be set to fixed values. 


Test data registers selected by the current instruc- 
tion retain their previous state. The instruction does 
not change while the TAP controller is in this state. 


When the TAP controller is in this state and rising 
edge is applied to tck, the controller enters either the 
Exit 1-IR state tms is held at 1 or the Shift-IR state if 
tms is held at 0. 


SHIFT-IR 
In this controller state the shift-register contained in 
the instruction register is connected between tdi and 


tdo and shifts data one stage towards its serial out- 
put on each rising edge of tck. 
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Test data registers selected by the current instruc- 
tion retain their previous state. The instruction does 
not change while the TAP controller is in this state. 
When the TAP controller is in this state and a rising 
edge is applied to tck, the controller enters either the 
Exit1-IR state if tms is held at 1 or remains in Shift-IR 
state if tms is held at 0. 


EXIT 1-IR 


This is a temporary controller state. If tms is held 
high, a rising edge applied to tck while in this state 
causes the controller to enter the Update-IR state, 
which terminates the scanning process. If tms is 
held low and a rising edge is applied to tck, the con- 
troller enters the Pause-iR state. 


Test data registers selected by the current instruc- 
tion retain their previous state. The instruction does 
not changes while the TAP controller is in this state 
and the instruction register retains its state. 


PAUSE-IR_ 


This controller state allows shifting of the instruction 
register to be halted temporarily. 


Test data registers selected by the current instruc- 
tion retain their previous state. The instruction does 
not change while the TAP controller is in this state 
and the instruction register retains its state. 


The controller remains in this state while tms is low. 
When tms goes high and a rising edge is applied to 
tck, the controller moves on to the Exit 2-IR state. 


EXIT 2-IR 


This is a temporary controller state. If tms is held 
high and a rising edge is applied to tck while in this 
state, termination of the scanning process results, 
and the TAP controller enters the Update-iR control- 
ler state. If tms is held low and a rising edge is ap- 
plied to tck, the controller enters the Shift-IR state. 


Test data registers selected by the current instruc- 
tion retain their previous state. The instruction does 
not change while the TAP controller is in this state 
and the instruction register retains its state. 


UPDATE-IR 
The instruction shifted into the instruction register is 
latched onto the parallel output from the shift-regis- 


ter path on the falling edge of tck in this controller 
state. Once the new instruction has been latched, it 
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becomes the current instruction. Test data regis- 
ters selected by the current instruction retain 
their previous state. 


When the TAP controller is in this state and a rising 
edge is applied to tck, the controller enters the Se- 
lect-DR-Scan states if tms is held at 1 or the Run- 
Test/Idle state if tms is held at 0. 


INSTRUCTION REGISTER 


The function of the instruction register is to select 
the operating mode of the test logic. For instance, 
read the ID register, or capture the 82489DX output 
signals. 82489DX has implemented 4 instructions. 


__Instruction | Mandatory/Optional | Opcode | 
bypass | Bi AT 
extest | om | 0.000 

sample/preload Raia: eas se 

i Siac A BEE 
eee aes 


idcode 


reserved 


Bypass Instruction 


The bypass instruction selects the bypass register to 
_ be connected to tdi and tdo, effectively bypassing 
the test logic on the 82489DX boundary scan path 
and reducing the shift length to be on one bit. Note 
that an open circuit fault in the board level test data 
path will cause the bypass register to be selected 
following an instruction scan cycle due to the inter- 
nal pull-up on the tdi pin. This has been done to 
prevent any unwanted interference with the proper 
operation of the system logic. 


Extest Instruction 


The extest instruction allows testing of circuitry ex- 
ternal to the component package, typically board in- 
terconnects. It does so by driving the values loaded 
into the 82489DxX’s boundary scan register out on 
the output pins corresponding to each boundary 
scan cell and capturing the values on 82489DX’s 
input pins to be loaded into their corresponding 
boundary scan register locations. |/O pins are se- 
lected as input or output depending on the value 
located into the output control cell. Values shifted 
into input latch in the boundary scan register are 
never used by the internal logic of the 82489DX. 
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82489DX must be reset after extest instruction has 
been executed. 


Sample/Preload Instruction 


The sample/preload instruction has two functions 
that it can perform. When the TAP controller is in the 
CAPTURE-DR state, the sample/preload instruction 
allows a snap-shot of the normal operation of the 
82489DxX without interfering with that normal opera- 
tion. The instruction causes boundary scan register 
cells associated with outputs to sample the value 
being driven into the 82489DX. On both outputs and 
inputs the sampling occurs on the rising edge of tck. 
When preloads data into the 82489DX pins to be 
driven to the board by executing the extest instruc- 
tion. Data is preloaded to the pins from the boundary 
scan register on the falling edge of tck. 


dcode Instruction 


The idcode instruction selects the device identifica- 
tion register to be connected to tdi and tdo, allowing 
the device ID code to be shifted out of the device on 
tdo. Note that the bit stream shifted into tdi will ap- 
pear on tdo after all 32 bits of the DID has been 
shifted out. 


DEVICE IDENTIFICATION REGISTER (DID) 


The device identification is a 32 bits number which 
can be read by the external hardware by using the 
idcode instruction. The 82489DX device ID is as- 
signed to 1489A013 (hex). This is subject to change. 
The upper 4 bits of DID may be changed for different 
version. The 16-bit number (bit 27-bit 12) 489A 
(hex) is the part ID. The lower 12 bits are the manu- 
facturer ID for Intel which must be 013 (hex). 


BOUNDARY SCAN REGISTER 


82489DX has only one test data register, i.e., the 
boundary scan register. The boundary scan register 
is a single shift register path containing the boundary 
scan cells that are connected to all signal input and 
output pins of the 82489DxX. There are three generic 
type of boundary scan cells—input, output, and bi-di- 
rectional. For each input only cell, one stage of shift 
register is added to the boundary scan path. 


ADVANCE INFORMATION 


% . A 
intel | | 82489DX 
® 
All output pins will become tri-stateable when put driver with a specific tri-state control cell in the 
boundary scan is activated, regardless whether they scan path. The user must shift in a proper control 
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Figure 29. Logical Structure of Boundary Scan Register 
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BOUNDARY SCAN CELL NAMES IN ORDER 
FROM tdi TO tdo 


The following table is a list of the boundary scan cell 
names in the order from tdi to tdo. The type informa- 
tion indicates the purpose of the cells. 


| = input only cell 

B = bi-directional cell 

T = tri-state output cell 

C = tri-state control. Note that the signal name 


enclosed within the parenthesis is con- 


trolled by this cell. 


fe) 


6 


SB. | 
| 59 
| 60 
ee ee 
| 62 
ry 8 
eee 
| 5 
ci | 
0 ‘e4 
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es 
ae | 
cree 
eS 
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fe 
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Peed 
poaea 


io) 


a 


NTN TN ITN 
oO;|aoi in| oD 


reserved 


INTIN15 


reserved 


(esorved 


2-114 


[eenumber | Type [Name [ Pine 
er ee 
enti [as 
[st | record | 
[sc _| eserves | 
ee 


ie) 
£ 


reserved 


+ 


(reserved) 


INTINS 88 


<=] 
=! 
0) 
” 
© 

i 
£0) 
ok 


—s 


INTING 


reserved 


N 


© 
n 
© 
i [ 
© 
& 


aN 


IN 
o 
Zz 
= 
i 
5 


t 
(reserved) 


aS 
o>) 


Pifl Hi Hl fl w | w& 
ié) 2) N | O® 


5 a ees 
Orn 


Poa 
oie 
coed 
peo 
Cor, | 
coated 
Pagk D 
PINTINZ | 90 
Sat 3] 
Fema 
pAbe 
1 es 
| 93 
eal 
eee 
oS 


ER 
To _| tesoned) | 
Cintiny [86 
Ter 
RTE 


oO 
es 


INTINO 97 


N 


—_, 


SPARS 
P02 4 
| 103 
| 104 
T | reserved | 
(reserved) Pay 

| 105 

meer 


o 
& 


re») On n 
shea bibar bee 
>) 


ADVANCE INFORMATION 


ove ‘ pena eral - ea? a8 ‘ y 


82489DX 


it 


Ooo Oe Os) Pia Os. 1 wo 
a ea, eee ee eS ae ea ee 
ee) eee a a ee ee ee 


Det Hi cel I ee OE el 


™m 


2-115 


: ADVANCE INFORMATION 


82489DX 


BYPASS REGISTER 


The bypass register is simply a 1-bit shift register 
which connects between the tdi and tdo. When se- 
lected by using the bypass instruction, the data shift- 
ed into tdi will be shifted out from tdo one tck clock 
later. 


JTAG TAP Controller Initialization 


The TAP controller must be reset to test-logic-reset 
state when 82489Dx< is first powered up. There are 
two ways to reset the TAP controller: 


1. Assert trst to be 0, it will reset the TAP controller 
asynchronously. 


2-116 


B 
intel : 
2. Assert tms to be 1, and clock the TAP controller 


at least five times, the TAP controller will be reset 
after the fifth rising edge of the tck. 


After reset, the idcode instruction is loaded into the 
IR automatically. 


Note that the tms and trst pins both have an internal 
weak pull-up device to keep them to be logic 1 level. 
Therefore the user can simply apply 5 clocks at the 
tck input to reset the TAP controller. If the TAP con- 
troller is not reset properly, 82489DX may not func- 
tion because the boundary scan logic might be ac- 
tive which will impact the signals flow in and out to 
the chip. 
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11.0 ELECTRICAL 
CHARACTERISTICS 


11.1 D.C. Specifications 


ABSOLUTE MAXIMUM RATINGS 
Case Temperature Under Bias... —65°C to + 110°C 


Storage Temperature .......... —65°C to + 150°C 
Voltage on Any Pin 
with Respect to Ground...... —0.5 to Voc + 0.5 


[Symbol [Parameter 


Output Leakage Current 


[Output Leakage Curent 
NOTES: 


1. This parameter is measured with current load of 4 mA. 
2. This parameter is measured with current load of 1.0 mA. 
3. This parameter is for output without pulldown. 


Voc = 5V 5%; To = 0°C to + 85°C 


82489DX 


NOTICE: This data sheet contains information on 
products in the sampling and initial production phases 
of development. The specifications are subject to 


change without notice. Verify with your local Intel 
Sales office that you have the latest data sheet be- 
fore finalizing a design. 


*WARNING: Stressing the device beyond the “Absolute 
Maximum Ratings” may cause permanent damage. 
These are stress ratings only. Operation beyond the 
“Operating Conditions” is not recommended and ex- 
tended exposure beyond the “Operating Conditions” 
may affect device reliability. 


pv (Note) | 
pv | (Note) 
| mA 


‘anaes Teer Bre 
aA LH yay 9) 
ci 
mere 


| Max | Units 
eT Ses Eee aS. 
Oia ee Mae ee Rl 
See es ae 

V 


pA (Note 4) 
pA (Note 3) 


ee Lis 
bes io) Ge ccs 
eee eee 
Bane eae 
fee Vege ie eer 
es Bee eee 


4. This parameter is for tri-state output with pulldown and Voy = 3.0V. 


5. This parameter is for input with pullup at Vi, = OV. 
6. ICC bus output low current is measured at 0.6V. 
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11.2 A.C. Specifications 


A.C. Parameters Referencing 33 MHz System Clock 
Voc = 5V +5%; Tc = O°C to + 85°C ; 


Parameter oie Load 
Fig. (pF) 
CLKIN Period 


CLKIN High Time 
CLKIN Low Time 
CLKIN Rise Time 


CLKIN Fall Time 
ADS, BGT, DLE, M/I0, D/C, W/R, CS 31 
Setup Time 
D31-D0, DP3-DPO, A9-A3 Setup Time 
ADS, BGT, DLE, M/10, D/C, W/R, CS 31 
Hold Time | 
D31-D0, DP3-DP0, A9-A3 Hold Time 
D31-D0, DP3-DPO, Valid Delay Eat 
D31-D0, DP3-—DPO0O, Low-Z Delay 32 
When DLE is Not Used 
D31-D0, DP3-DPO, High-Z Delay 
When DLE is Not Used 
D31-D0, DP3-DPO Enable Delay 
When DLE is Used 
D31-D0, DP3-DP0 Disable Delay 
When DLE is Used 
RDY Valid Delay S80. 
PRST, PNMI, PINT Valid Delay ae 7 
RESET Setup Time ar | 
RESET HoldTime er 


RESET Cycle Time 


INTIN[15:0], LINTIN[1:0] Low Time 


All parameters are given in nanoseconds. 
TTL Level timing is measured at 1.5V for both “0” and “1” levels. 


NOTES: 

1. 1CC bus clock ICLK period must be at least 5 ns longer than system clock CLKIN for proper synchronization of the 
internal asynchronous signals. 

2. System clock CLKIN measured from 0.8V —2.0V. 

3. Minimum Reset cycle is the greater of the two cycle times. 

4. Minimum pulse width must be met for valid level to be attained on the DATA or ADDRESS output. 

5. Set up and hold time is required for RESET to start at the next rising edge of the clock. 

6. INTIN and LINTIN low time is measured from 1.5V of the falling edge to 1.5V of rising edge. 

7. Not 100% tested. Guaranteed by design characterization. 
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Time Base A.C. Parameters 
Voc = 5V +5%; Tc = 0°C to + 85°C 


[Symbol | Parameter 
[mime | TWEASEPefod | as 
[eo | TWBASEigh Time [95 
[et] TWBASE LowTime [95 
5 Mee 
ar bes] 


TMBASE Rise Time 
TMBASE Fall Time 


Combo | Paramewe 
[to | ToKghTme 
[is | TOKFaltime 
ie 
rise 
rise 


TRST Minimum Low Time 
All parameters are given in nanoseconds. 


TTL level timing is measured at 1.5V for both “O” and “1” levels. 


NOTES: 
1. These parameters are specified for 50 pF load. 
2. This parameter is measured at 1.5V between the rising and falling edges. 
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A.C. Parameters for ICC Bus 
Voc = 5V +5%; Tc = 0°C to + 85°C 


ICLK Low Time 
ICLK Rise Time 


All parameters are given in nanoseconds. 


ICLK Period 
ICLK High Time 


symbol _ 
Pet 


TTL level timing is measured at 1.5V for both ‘‘O” and ‘‘1” levels. 


NOTES: 


1. MBI3-0 and MBO3-0 timing is tested at 150 ns cycle time. 


2. This parameter is specified for 50 pF load. 
3. Not 100% tested. Guaranteed by design characterization. 


12.0 REGISTER SUMMARY 


82489DX registers can be located at any 1 Kbyte 
boundary in either memory or I/O space for as far as 
the 82489DxX architecture itself is concerned. From 
a platform standard point of view, it is recommended 
to locate all 82489DX Local Units in memory space 
at address OxFEEO—OO0O. It is further recommend- 
ed that all 82489DX I/O Units also be located in 
memory space; I/O Unit 1 at address 0xFECO— 
0000, I/O Unit 2 (if present) at address OxFECO— 
1000, and so on. Chip select for the 82489DX 
should be based on a full decode of address pins 
A31-A10. 


All directly accessible 82489DxX registers are 32 bits 
wide and are aligned at 128-bit boundaries. The reg- 
ister being accessed is determined by bits 4 through 
9 of the address. This is listed in the tables below. 
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Addresses not listed are reserved by the architec- 
ture. The tables also show whether the register is 
readable and/or writable by software, and what the 
side effects are of software accessing the register. 


After reset, all registers are initialized to all zeroes 
with the following exceptions, The Local Unit ID field 
is initialized with data present on the 8 LSB address 


pins. The Mask bit is initialized to 1 (‘“‘masked” state) 


in all entries in both the local vector table and the 
redirection table. 


For the |/O Unit, only the I/O register select and I/O 
window registers are directly accessible in the ad- 
dress space. The other I/O unit registers are ac- 
cessed indirectly through the select and window reg- 
ister. 
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I/O Unit Registers hee 


Register Address(9:4) | SW | __ SideEffects 
/0 Register Select 00 0000 pice ame ee A 


Register [Wo Reg Seiect(70) | SW | Side Effects 
[Version Register ——=«d(~—~—~=Co0 001i 

[Redirection Table f01(@10) | 000+0000 | ww 
[Redirection Table (0) (63:82) | 00010001 =| rw 
[Redirection Table 11 9%0) | oooroo10 =| rw 
[Redirection Table (116392) | _oooroon | aw 
[Redirection Table [21(01:0) | oooro100 «| rw 


: ager gpee 
3 ae ae 
, we 
: mer ee 
: NT 
[Redirection Table (6) [91:0] | o0ors010. «|S w 
[Redirection Table [61 (65:02) [oor to11 Sw 
[Redirection Table (611i) | 0001 100 «iY rw 
Redirection Table (616992) | ooo n01——«d| rw 
[Redirection Table 7Ifor0 | _ooor10———iY Sw 
Redirection Table 716092) [ois iw 
Redirection Tabi [6] (91:0) | 00100000 «| rw 
Redirection Table [6] 63:92) | _o0100001 =| ow 
[Redirection Table [9] [a1.00) | 00100010 «| rw 
Redirection Table [9] (65:02) | _on100011 «iw 
Redirection Table [101 61:00) | _oo100100 «| rw 
[Redirection Table [101 63:92) | oo100109 «| w 
[Redirection Table [111 (910) | _oo100110 <i S rw 
[Redirection Table [111 (69:92) | _oooonns iw 
[Redirection Table 121 (61:0) | 00101000 =| rw 
[Redirection Table [121 69:92) | _oo101001 «| w 
Redirection Table [1a] 10) | _oov01010 SiS 
Redirection Table [1a] [69:92] | _oo1o1011 «dw 
[Redirection Table 14] (61:0) | _oo101100 «iY rw 
Redirection Table [14] 69:92) | _oo1oni01—sY ew 
[Redirection Tabieis}(3t0] | 0010110 


Redirection Table [15] [63:32] 0010 1111 fi 2 en re 
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LOCAL UNIT REGISTERS 


Address (9:4) 
Local Unit ID Register 00 0010 Reis we: [peas 


Soe os | a 
Cee | eee 
or ae ee a SG 
aed eee 
Eo Renin Regie [ofa Le ae ee 
[Logie Destnaton Reg [oot | wf 
[Deetination FommatReg. | 001110 «| ow 
[spurtous VeotorRegistes | oovmi1 | ow 
mg yee Pe 
a 
TUT A SR TE 
eT 
[sr vier160) itor 
ae. 
cares. 
pera 
manna 
earrE 
3 aR 
: rape 
3 er 


, 


ISR (223:192) 010110 


re 
[Twn ess2y id 
au.) aR ES PTR 
omaeeda | ew 
Te 
Ten is0:12 =i ooo 
1 eagsia o>] ee 
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LOCAL UNIT REGISTERS (Continued) 


NOTE: 
Address space 101000 to 101111 and 111111 are reserved 


13.0 TIMING DIAGRAMS 


ty 1+ tao, toy 


Output Signal 


290446-20 


Figure 30. Output Waveform 
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13.0 TIMING DIAGRAMS (Continued) 


Input Signal 
290446-21 


Figure 31. Input Waveforms 


D[3 1:0], DP[3:0] 


—— em scaemesener eesmyeas mp wesw esl, 
RDY 


290446-22 


Figure 32. Data Bus Tri-State Delays when DLE Sampled Low with ADS 
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13.0 TIMING DIAGRAMS (Continued) 


DLE 


D[ 31:0], DP[3:0] 
290446-23 


Output Signal, TDO 


290446-24 


Figure 34. TAP Signal Timings 
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13.0 TIMING DIAGRAMS (Continued) 


t39,l42, 
(52 


(30, 40» 50 (31, (4), t51 


wee ra 


TMBASE 


tme> ice ic 
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Figure 35. TMBASE, ICLK, TCK Timing 
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Figure 36. ICC BUS Open-Drain Output Delay 
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14.0 PACKAGE PIN-OUT 


132-Lead PQFP 
Package Type KU (See Packaging Specification. Order Number 240800) 


132 E> Vssp 


Yss 

Voc 
INTINO 
INTIN1 
INTIN2 
INTIN3 
INTIN4 
INTINS 
INTIN6 
INTIN7 
INTINS 
INTINS 
INTIN1O 
INTIN 11 
INTIN12 
INTIN13 
INTIN14 
INTIN1TS 
LINTINO 
LINTIN 1 
MB10 
MB11 
MB12 
MB13 
Reserved 
cs 

DLE 
Reserved 
Reserved 
Reserved 
Voc 

Vss 

Vss 


COON AUNAWD = 


TOP VIEW 
132 —-LEAD PQFP 


Reserved 
Reserved 


Vsspo C4 36 
Vecpo C—4 39 
Vsspo C4 40 


Reserved GC.wJd 34 
EXINTA Cg 41 
Reserved CuwJj 42 
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NOTE: 
See pin description section for appropriate pin-strapping of the reserved pins. 
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15.0 PACKAGE THERMAL any environment, to determine whether the device is 
SPECIFICATION within the specified operating range. 


The 82489DX is specified for operation when the | The PQFP case temperature should be measured at 
case temperature is within the range of 0°C to _ the center of the top surface opposite the pins, as 
+ 85°C. The case temperature may be measured in shown in Figure below. 


MEASURE PQFP CASE TEMPERATURE 
AT CENTER OF TOP SURFACE 


290446-26 


Plastic Quad Flat Pack (PQFP) 


PQFP Package Thermal Characteristics 


Thermal Resistance—°C/W 


6 Junction to Ambient 


| 0 | 200 | 400 | 600 | 00 | 1000 
. ClineNenio Gand 8) 2 ae | Ta Lg 898 Be 8] 


NOTES: 
1. Table above applies to 82489DX PQFP plugged into a socket or soldered directly into the board. 
2. Oya = Ojo + Oca. 
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16.0 GUIDELINES FOR 82489DX 
USERS 


16.1 Initialization 


This section outlines one possible initialization sce- 
nario. Other scenarios are certainly possible, and 
one would be selected as part of a platform stan- 
dard initialization scheme. The intent of this section 
is to illustrate that the initialization support provided 
by the 82489DX is adequate to support MP (Multi- 
processor) system initialization. 


Each 82489DX has a RESET input pin connected to 
a common Reset line. Upon system reset, this com- 
mon reset line is activated, causing all the 
82489DXs to go through reset. All 82489DX local 
units (note: only local units and not !/O units) latch 
their ID from their address bus on reset. The ID can 
be provided by the bus control agent based on slot 
number. 


The local units next assert their processor’s Reset 
pin, holding the processor in reset, and next perform 
their internal reset, setting all registers to their initial 
state. The initial state of all 82489DX Units (both 
local and |/O units) is ‘all masks set” and all Local 
Units disabled; registers are otherwise initialized to 
zero. Note that the PINT and PNMI output pins are in 
tri-state mode when the local unit is disabled. After 
this, each 82489DxX local unit will deassert its proc- 
essor’s Reset pin, allowing the processors to come 
out of reset and perform self test and start executing 
initialization code. 


Note that while connecting PRST pin it should be 
noted that whenever PRST pin is activated by 
82489DxX either because of software reset message 
or hardware reset, the 82489DxX itself is reset. It 
should be taken care in the cases of Warm reset 
where only processors need to be reset and not the 
interrupt controller. In brief, the usage of PRST de- 
pends upon the system requirement on various re- 
set. 


Somewhere in this code sequence, the processors 
that are “alive” will enable their 82489DxX local units, 
and attempt to force all the other processors back 
into Reset. Forcing the other processors into reset is 
performed by sending them the inter-processor in- 
terrupt with Destination Mode = ‘Physical’, Deliv- 
ery Mode = ‘Reset’, Trigger Mode = “Level”, 
Level = “1”, and Destination Shorthand = “All Excl 
Self’. Only the first processor to get the ICC bus will 
succeed in sending this signal and reset all other 
82489DxXs and their processors. The other proces- 
sors are kept in reset until such time that an MP 
operating system decides they can become active 
again. The only running processor next performs the 
rest of system initialization. 
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Eventually, an MP operating system will be booted 
at which time the operating system would send 
“deassert reset’ interprocessor signals to activate 


_ the other processors in the system. A mechanism 


must be provided by the platform that allows the 
added processors to differentiate the very first reset 
from a subsequent one. 


16.2 Compatibility 
COMPATIBILITY LEVELS 


The 82489DX can be used in conjunction with stan- 
dard 8259A-style interrupt controllers to provide a 
range of compatibility levels. 


At the lowest level we have “PC shrink-wrap” com- 
patibility. This level effectively creates a uniproces- 
sor hardware environment within the MP platform 
capable of booting/running DOS shrinkwrap soft- 
ware. In this mode, only the 8259A generates inter- 
rupts and the 82489DX becomes a virtual wire. The 
interrupt latency can be minimized by connecting the 
8259A interrupt to local unit directly. 


The next level preserves the software compatible 
view of an 8259A but it allows more than one proc- 
essor to be active in the system. This results in an 
asymmetrical arrangement, with one processor field- 
ing all 8259A interrupts but with added inter-proces- 
sor interrupt capability. In this mode, 82489DX 
“merges” 8259A interrupts with inter-processor in- 
terrupts. Existing |/O drivers would be bound to the 
compatible CPU and interface directly with the 
8259A. 


At the next compatibility level, 8259A compatible 
drivers can be mixed with native 82489DxX drivers. 
Devices can generate interrupts at either 8259A or 
an 82489DX. This provides for partial symmetry as 
individual drivers migrate from the 8259A to native 
82489DxXs. 


Another 8259A compatible point can be defined for 
MP systems. Each processor could have its own 
compatible 8259A controllers, allowing multiple 
processors to run compatible |/O drivers, but stati- 
cally spreading the load across the available proces- 
sors. 


82489DX/8259A INTERACTION 


The principle of compatible operation is very 
straightforward; the 82489DX(s) become a virtual 
wire connecting the 8259A’s INT output through to 
the processor, while at the same time making 8259A 
visible to the processor. 
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The two connection schemes described only differ 
in the number of 82489DX(s) (one or two) that are 
located in the path from the 8259A to the processor. 
In the one 82489DX example illustrated in Figure 37, 
the INT output of the 8259A connects to one of the 
Interrupt Input pins of the 82489DX through an edge 
generation logic. This could be an interrupt pin on 
the 82489DX’s I/O unit or local unit; assume a local 
interrupt input is used. The Local Vector Table entry 
for the interrupt pin that connects to the 8259A is set 
up with a Delivery Mode of “ExtINT” and edge trig- 
ger mode. This indicates that the interrupt is gener- 
ated by an external controller. The processor’s INT 
pin connects to the 82489DX PINT pin. 


This setup enables the 82489DxX local unit to detect 
assertions (up-edges) of the 8259A’s INT output pin 
and pass this on to the processor’s INT input. 
82489DX asserts ExtINTA pin along with (one clock 
prior to) PINT pin to indicate “8259” interrupt. When 
the processor performs its INTA cycle the 82489DX 
itself does not respond other than deasserting PINT 
to the processor. At the third clock after ADS in the 
second bus cycle of INTA cycle ExtINTA is deassert- 
ed. External logic should make use of the ExtINTA 
signal to make the INTA cycle visible to the 8259A 
and the 8259A should provide the vector. At the 
same time, the local unit considers the external re- 
quest as delivered, and need not wait for the exter- 
nal 8259A’s INT to be deasserted. A new up-edge 
must be generated on the 8259A INT pin before the 
local unit will assert the processor’s INT pin on be- 
half of the 8259A. External edge generation logic 
should be used for this. Compatible software inter- 
acts directly with the 8259A. 


The mechanism is essentially the same in the two- 
82489DX scheme. The difference is that the 8259A 
connects to an interrupt input pin of the 82489DX 
I/O unit in the 1/O system. The Redirection Table 
entry for this pin is again programmed with an 
“ExtINT” Delivery Mode, and the (single) 82489DX 
destination local ID corresponding to the compatible 
DOS processor. Capturing the up-edges of the 
8259A’s INT pin by the 82489DxX local unit now in- 
volves sending messages from the 82489DX I/O 
unit to the 82489DxX local unit via the ICC bus. The 
“virtual wire” now includes messages over the ICC 
bus. | 


Adding inter-processor ICC interrupts (or any other 
82489DX generated interrupts) to the compatible 
operation is accomplished by having the 82489DX 
internally OR the 8259A’s INT request with any 
82489DxX interrupt request. 


Before the 82489DxX actually sends the interrupt sig- 
nal to the processor, the 82489DX decides whether 
it does this for an 82489DX interrupt or whether it 
does this on behalf of the external controller. When 
the processor performs the corresponding INTA cy- 
cle, only the 82489DX knows whether it should re- 
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spond with a vector, or whether the external 8259A 
should. 


lf the 82489DX needs to respond, then it will enable 
an externally implemented trap that prevents the 
8259A from seeing the INTA cycle. If the 8259A 
needs to respond, then the 82489DxX will not enable 
the INTA trap, and the INTA will be allowed to reach 
the 8259A. 82489DX implements this by asserting 
its EXTINTA pin to indicate external 8259A should 
respond with the vector. The 82489DX local unit 
controls the INTA trap via its “ExtINTA”’ output pin; 
the 82489DX does not actually provide the trap it- 
self. 


MP "SYSTEM" BUS 


PROCESSOR 


Block 


“ 
2 
“ 
© 
3 
> 
© 
a 
~ 
a 
3 
~~ 
- 
6 
= 
= 


8259A 
Equivalent 


1/0 EXPANSION BUS 
290446-9 


Figure 37. Edge Logic 


82489DX/8259A DUAL MODE CONNECTION 


In systems that can be booted either as a configura-. 
tion with compatible 8259A or without, device inter- 
rupt lines are connected to both the Interrupt Re- 
quest pins of the 8259A and Interrupt Input pins of 
the 82489DxX with all interrupts either masked at the 
82489DX or at the 8259A. Some EISA and Micro- 
Channel chip sets that include on-chip 8259As also 
have internally connected interrupt requests. For ex- 
ample, the 82357 (the ISP of the EISA chipset) gen- 
erates timer and DMA chaining interrupts internally. 
These are not available as separate interrupts out- 
side the ISP. In non compatible mode the ISP timers 
are not used, since each local 82489DX unit pro- 
vides its own timer. Therefore, the ISP’s 8259A is 
configured to mask out all interrupts except the DMA 
chaining interrupt which is configured in level-sensi- 
tive, auto EOI mode. This causes the 8259A’s INT 
output to track the state of the internal DMA inter- 
rupt request. The 8259A’s INT output is then con- 
nected to one of the 82489DxX interrupt input pins 
programmed to generate a regular (i.e., not 
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“ExtINT’”’) level-sensitive interrupt. The ISP 8259A 
then no longer functions as an external interrupt 
controller; it has been logically disabled, and it 
needs no interrupt acknowledge or EOI. The INTA 
and EOI cycles occur only at the 82489Dx. It should 
be noted that 82489DX accepts only active high lev- 
el/edge interrupt inputs. External programmable log- 


ic should take care of polarity reversal that may be 
needed in EISA system for sharing of interrupts. 


16.3 Hardware Guidelines 


82489DX HARDWARE STATE ON RESET 


The 82489DX goes to reset state either by Hard- 
ware Reset state or Software Reset message re- 
ceived on the ICC bus. On reset, 82489DxX is dis- 
abled. The following is the hardware state of 
82489DxX after reset. 


PRST | Active (HIGH) 


| PNMI | TRI-STATED (Internal Pull-Down Provided) 


PINT | TRI-STATED (Internal Pull-Down Provided) 


82489DxX is disabled on Reset and unless specifical- 
ly enabled, it does not start its interrupt mechanism. 


The difference between hardware reset and soft- 
ware reset. message is that during hardware reset 
82489DX samples the address bus and stores the 
last sample in Local Unit ID whereas for software 
reset it does not sample and store the unit ID. In 
addition, during the hardware reset pulse should be 
wide enough to accommodate for at least one rising 
and falling edge of ICLK. On hardware reset ExtiN- 
TA is held high. 


PULL UP AND PULL DOWN RESISTORS 


PNMI, PINT are tri-stated at power on and they are 
maintained in tri-state condition till the unit is en- 
abled. Eventhough internal pull down resistor is pro- 
vided on PNMI and PINT external additional pull 
down resistor may be needed depending upon the 
loading on these pins by external logic. The DC 
characteristics gives the control specification from 
which the value of resistor, if needed, can be calcu- 
lated. 


It should be kept in mind that the ICC bus being 
electrically open drain bus requires pull up resistors 
at the MBO pins. ICC bus output low current is just 
4 mA. 


PINT and ExtiINTA Timings 


It should be noted that for ExtINTA type of interrupts 
PINT gets activated one clock after ExtINTA gets 
activated. When getting deactivated, both PINT and 
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ExtINTA gets deactivated at the same clock edge. 


ExtINTA Timings 


In the interrupt acknowledge cycle for External Inter- 
rupt control, 82489DX asserts ExtINTA. It decodes 
the type of cycle from CPU control signals like M/IO, 
D/C and W/R. CPU does two bus cycles back to 
back for interrupt acknowledge cycle. 82489DX 
maintains ExtINTA active throughout the first cycle. 
For next cycle (when the vector will be given by ex- 
ternal 8259) after 82489DX senses the start of the 
cycle (by ADS) 82489DX deactivates ExtINTA. Ex- 
ternal control logic may be inserting wait states to 
match the 8259 timings. Since 82489DX has no way 
of finding out the cycle completion, 82489DX deas- 
serts ExtINTA before the second bus cycle gets 
completed. This should be kept in mind while using 
ExtINTA for external interrupt control logic. 


82489DX AND MEMORY MAPPING 


The 82489DxX is a 32-bit high performance interrupt 
controller. It allows the CPU to do 32-bit read and 
write to it. By memory mapping 82489DX the system 
performance can be enhanced. It should be noted 
that 82489DX does not support pipelining. Even- 
though 82489DX can be memory mapped, its func- 
tionality as an interrupt controller should be kept in 
mind while programming the virtual memory man- 
agement control data structure. The caching policy 
for the page where 82489DxX is mapped should also 
be done with the functionality of 82489DX in mind. 
For example, the reads to 82489DX should not be 
cached and writes should be write-through. Since 
82489DxX registers are aligned at 128-bit bounda- 
ries, memory mapping 82489DX with interleaved 
memory system should not be a problem. 


JTAG CIRCUIT CONSIDERATIONS 


The JTAG circuit is used for boundary scan test. The 
JTAG pins has a TCK, (JTAG clock), TRST, (JTAG 
Reset), TDI, (Test Data Input), TMS, (Test Mode Se- 
lect) and TDO, (Test Data Output). 


The JTAG circuitry, if not used, should be properly 
deactivated so that it will not interfere in the normal 
functional operations. The JTAG can be inactivated 
in any one of the following ways: 


1. JTAG inactivation through TRST: The TMS, Test 
mode Select should be either left open (internal 
pull up is provided) or tied to Voc. The TRST can 
be pulsed low (bring it low and after meeting the 
pulse width requirement bring it back high again) 
to keep the JTAG circuitry to idle state. The 
TRST pulse brings the JTAG circuitry to idle state 
and TMS being kept high maintains the JTAG cir- 
Ccuitry in idle state. 
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2. JTAG inactivation through TCK: The TMS, Test 
mode select should be either left open (internal 
pull up is provided) or tied to Voc. The TCK within 
5 clocks brings the JTAG circuitry to idle state. 
The TMS, being held at logic high level, maintains 
the JTAG circuitry in idle state. 


16.4 Programming Guidelines 


The 82489DxX register data structure contains differ- 
ent fields to specify the mode of operations and the 
options available within each mode. Since certain 
options are applicable to specific modes only (for 
example “Remote Read” mode applies only to inter- 
rupt command register, it does not have relevance 
to I/O unit’s redirection tables) the following pro- 
gramming guidelines are provided. 


UNIQUE ID REQUIREMENT | 


All the local units and I/O units hooked in a ICC bus 
should have unique ID before they can use the bus. 
This should be ensured by the programmer since for 
ICC bus arbitration the units (whether it is local unit 
or I/O unit) arbitrate with their unit ID. 


For future compatibility, the Units should be as- 
signed IDs starting with 0, 1, 2 etc. with the highest 
ID in the system being number of units minus 1. So 
in a four 82489DX system there are four local units 
and four I/O units. The ID starts with 0 and the high- 
est ID in the system will be 7. Note that each unit 
should have different ID in the system. 


ATOMIC WRITE READ TO TASK PRIORITY 
REGISTER 


Normally, the task priority register is written with 
highest priority to mask certain low level interrupts 
before entering into critical section code. In a sys- 
tem where 82489DX is memory mapped the CPU 
may buffer this task priority register write to its on 
chip write buffer. The following scenerio can happen 
in such situation: CPU posts task priority register 
write to its on chip write buffer and enters into the 
critical code. A lower priority interrupt (which should 
not enter the critical code) interrupts the CPU before 
the write buffer gets flushed into task priority regis- 
ter). The CPU accepts the lower priority interrupt. To 
avoid the situation atomic write read to task priority 
register should be done. The read following write en- 
sures that the write buffer is flushed to task priority 
register and the atomicity ensures that no interrupt 
will be accepted by the CPU during its write to task 
priority. 


It should be noted that if the CPU does interrupt 


acknowledge cycle only after flushing the write buff- 
ers then the above situation may not arise. 


2-132 


SPS aR rey ig Pas: 
mgt 
intel. 


CRITICAL REGIONS AND MUTUAL EXCLUSION 


Each 82489DX has a single Interrupt Command 
Register that it uses to send interrupts to other proc- 
essors. The programmer should make sure to syn- 
chronize access to this register. Specifically, 1). writ- 
ing all fields of the register, 2). Sending the interrupt 
message (by writing the LSB register), and 3). wait- 
ing for Delivery State to become Idle again, should 
occur as a single atomic operation. For example, if 
interrupt handlers are allowed to send inter-proces- 
sor interrupts, then interrupt dispensing to the proc- 
essor must be disabled for the duration of these ac- 
tivities. 


INTERRUPT COMMAND REGISTER 
PROGRAMMING SEQUENCE 


The interrupt command register (31:0) has a side 
effect of sending interrupt once it is written. The des- 
tination is provided in the interrupt command register 
(63:32). So always interrupt command register 
(63:32) should be programmed before programming 
interrupt command register(31:0). 


Program Interrupt Command Register (63:32) 


J 


Program Interrupt Command Register (31:0) 


INTERRUPT VECTOR 


Two different interrupts should not be programmed 
with the same interrupt vector. 


LOCAL AND 1/0 UNIT 


Only Interrupt command register supports “Remote 
Read” Delivery mode. Local and |/O unit interrupts 
do not support “Remote Read’”’. 


ICR (INTERRUPT COMMAND REGISTER) 
1. ExtINTA delivery mode is not supported for all 
destination shorthands. 


2. “Remote Read” should always be programmed 
as “‘Edge’”’ triggered interrupt. 


3. “Remote Read” should always be programmed 
with physical destination mode (and not with Log- 
ical Destination mode). Broadcast addressing 
should not be used for Remote Read. 


4. For “all incl Self’ and ‘‘All exc. self’? destination 
shorthands, “remote read” delivery mode should 
not be used. 


5. For “all incl self’ and “self? destination short- 
hands “Reset” delivery mode should not be 
used. 
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ICR (INTERRUPT COMMAND REGISTER) 
(Continued) 
6. For ‘all exc self’? destination shorthand if “Re- 
set” delivery mode is used, it should be ensured 


at system level that only one processor executes 
this instruction at any time. 


7. Messages could be sent out in “Logical” or 
“Physical” mode with destination ID of all 1’s de- 
pending on the way Destination mode entry is 
programmed. In brief, ‘‘All incl self” and “All exc. 
Self’? support both ‘“‘Logical’” and “Physical” ad- 
dressing mode. 


8. When destination shorthand (i.e., broadcast) is 
used with lowest priority destination mode, then 

~ even though all participates in arbitrating for des- 
tination, only the lowest priority gets the mes- 
sage. So even though the addressing is broad- 
cast since the destination mode is lowest priority 
only one gets the message. 


9. When destination shorthand (i.e., broadcast) is 
used with ‘‘Fixed”’ destination mode, then all the 
units get the message. 


ISR/IRR/TMR 


Bits 0-15 of IRR/ISR/TMR do not track interrupt. 
No interrupt of vector numbers from 0-15 can be 
posted. The total interrupt supported are 240. When 
reading the lowest 32 bits of these registers, 0 will 
be returned for the lower 16 bits. 


FOCUS PROCESSOR 


Focus processor is applicable only within the ad- 
dressed units. 


ExtINT Interrupt Posting 


The external interrupt has no priority relationship 
with the 82489DxX priority. But when posting an inter- 
rupt to the processor, if both an external interrupt 
and a 82489DX interrupt are pending, 82489DX 
could post either one to the processor. In 82489DX 
implementation, it would post external interrupt 
whenever there is no other 82489Dx interrupt that 
can be posted to the processor. It should be also 
noted that External interrupts can not be masked by 
raising task priority. However, they can be masked 
by the mask bit in the table entry for the (“ExtINTA”’) 
external interrupt. 


The extINT interrupts are specific in their character- 
istics in that they do not have any priority relation- 
ship with the rest of the interrupt structure. ISR and 
IRR bits in 82489DX are used to do the housekeep- 
ing functions for interrupt priority. Since extINT inter- 
rupts do not have any priority relationship, ISR and 


: ADVANCE INFORMATION 


—_— a — 2 eh 4 ~ Yo We .. 6 tx 
em ere Lae SS oe ° ie Shee Uae ee Pe. A Sas wey Sry a ea oe Ee 


82489DX 


IRR bits are not maintained for external interrupts. 
As far as interrupt acceptance is concerned, if more 
than one extiINT interrupts are directed towards a 
local unit, that local unit treats all the extINT inter- 
rupts directed to it as only one extiNT interrupt. This 
leads to an important point that in a system not more 
than one interrupt should be programmed as extINT 
interrupt type with the same destination. It should be 
noted that there can be more than one extINT type 
of interrupt in a system with each having different 
local unit as destination. 


Synchronizing Arb IDs 


Initialization of an 82489DX’s local unit ID is imple- 
mentation dependent. In some platforms, power-on 
reset will latch the right values into the 82489DxXs; in 
other platforms, unique IDs may be assigned by ini- 
tialization firmware. In both cases the 82489DX I/O 
unit should be assigned unique ID by initialization 
firmware. The important point is that the 82489DXs 
are required to have unique IDs before they can use | 
the bus, and in addition, all their Arb IDs must be ‘“‘in 
sync’. Synchronizing Arb. IDs is accomplished as a 
side effect of a ‘“deassert reset” interrupt command. 
This resets the (rotating) Arb ID to the (constant) unit 
ID; it assumes that all 82489DXs have their unique 
ID. 


LOWEST PRIORITY 


“Only once delivery” semantics for a group destina- 
tion is guaranteed only if multiple fixed delivery of 
the same interrupt vector are not mixed. For lowest 
priority arbitration to work, all the arbitration ID of 
local 82489DXs in the system should be in sync. 
This means after local unit IDs are written in all local 
units (each ID should be different from other IDs) a 
RESET DEASSERT message should be sent in ALL 
INCLUSIVE mode. The RESET DEASSERT mes- 
sage should be sent before system is used for low- 
est priority arbitration. This ensures that all ARB IDs 
are also different. (Arb IDs are copied from local unit 
IDs during RESET DEASSERT message.) 


The RESET DEASSERT message, if not sent, only 
one delivery semantics may not be guaranteed in 
the cases where lowest arbitration is used in the sys- 
tem. 


DISABLING LOCAL UNIT 


Once the 82489DX is enabled by setting bit 8 of 
spurious vector register to 1, the user should not 
disable the local unit by resetting the bit to 0. The 
result will put the local unit in an inconsistant state. 
The local unit can be disabled by sending “reset’’ 
interrupt message to the local unit. 
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ISSUING EOI 


EOI, End of Interrupt issuing indicates end of service 
routine to 82489DX. The ISR bit which is set during 
INTA cycle gets cleared by EOI. This section dis- 
cusses the relevence of EOI to the specific types of 
interrupts and its timing related to interrupt deasser- 
tion. 


EXTERNAL INTERRUPTS AND EOI 


External Interrupts should be programmed as edge 
type. INTA cycles to external interrupts are taken 
automatically as EO! by 82489DxX. This is similar to 
AEOI, Automatic End of Interrupt of 8259A. So there 
is no need to issue EOI to 82489DX for external 
interrupt servicing. This is done to achieve software 
transparency in the compatible mode. 


SPURIOUS INTERRUPTS AND EOI 


Spurious interrupts do not have any priority relation- 
ship to other interrupts in the system. So IRR is not 
set for spurious interrupts. EOI should not be issued 
for spurious interrupts. It is advisable not to share 
the spurious interrupt with any vector. If spurious in- 
terrupt vector is shared with some other interrupt 
then while servicing issuing EO! depends on the 
source of interrupt. If the source is spurious interrupt 
. (for which the corresponding IRR is not set) then 
EOI should not be used. If the source is a valid inter- 
rupt sharing the spurious interrupt vector (for which 
the IRR is set) then EOI should be issued. 


NM AND EOI 


For NM type of interrupt no IRR bit is set. So EOI 
should not be issued while servicing NMI type of in- 
terrupts. 


TASK PRIORITY REGISTER 


Task priority register is used to specify the priority of 
the task the processor is executing. In 8259 the 
priority is defined only among the interrupts that it 
handles. 82489DX goes farther ahead in handling 
priority. In multitasking system, in addition to device 
interrupts various tasks have different priority and 
82489DX allows consideration of the priority at sys- 
tem level. The processor specifies the priority of the 
task it executes by writing to task priority register. 
Now any interrupts at and below the task priority will 
be masked temporarily till the task priority gets low- 
ered. The masking granularity is at priority level. Out 
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of 256 interrupt vectors 16 priority levels are speci- 


fied and 16 vectors share one priority level. Since 
the masking granularity by the task priority register is 
at priority level, group of 16 vectors get masked 
when task priority register is increased by one level. 


When task priority is at its minimum level of 0, inter- 
rupt vectors having level 1 to 16 are passed to CPU. 
Stated in other words, even when the task priority 
register is at its minimum (of level 0), interrupt vec- 
tors at level 0 will be masked. This means that the 
interrupt should not be programmed with vectors 0 
to 15. So out of 256 interrupt vectors, only 240 inter- 
rupt vectors (vector 16 to 255) can be used in 
82489DxX. 


ExtINT INTERRUPT AND TASK PRIORITY 


ExtINT interrupt does not have any priority relation- 
ship with other interrupts or task priority register. So 
ExtINT interrupt can not be masked by raising task 
priority. They can be masked by writing to the vector 
table entry which corresponds to ExtINT interrupt. 


REMOVING MASKS 


‘When enabling units and removing Mask bits in situ- 


ations where a device may already be injecting inter- 
rupts into the 82489DX system, the Mask in the Re- 
direction Table should be removed last to ensure 
proper initial state (e.g., Remote IRR bit matching 
IRR in local unit). 


DELIVERY MODE AND TRIGGER MODE 
It is software’s responsibility to make sure that Deliv- 


ery Mode and Trigger Mode are set to meaningful 
combinations as listed below. 


Delivery Mode Trigger Mode 
Fixed edge/level 
Lowest Priority edge/level 
Remote Read edge 

NMI level 

Reset level 

ExtINT edge 


Software is also responsible for not using meaning- 
less Delivery Modes in Redirection Table entries and 
local Vector Table entries (e.g., use of Remote Read 
delivery mode). 
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ASSIGNING INTERRUPT VECTORS 


Software has total control over the assignment of 
interrupt vectors to interrupt sources. The operating 
system writer should be aware of a number of things 
when doing this assignment. 


Some processor architectures assign a predefined 
meaning to some of the vectors (i.e., entries in the 
interrupt table) as entry points to certain trap and 
exception handlers (e.g., divide error, invalid op- 
code, page fault, etc.). The programmer is strongly 
advised not to reuse these vectors. 


The programmer must also be careful when using 
the same vector number to represent different inter- 
rupt sources (sharing vectors). This is especially true 
for level triggered interrupts. When multiple sources 
with different Redirection table entries share an in- 
terrupt vector, any of the sources deactivating its 
level signal will remove the interrupt request for all 
sources. Giving each interrupt source its interrupt 
vector in any case is the preferred approach. 


SENDING INTER-PROCESSOR INTERRUPTS 


Each 82489DX has a single Interrupt Command 
Register that it uses to send interrupts to other proc- 
essors. It is software’s responsibility to synchronize 
access to this register. Specifically, 1) writing all 
fields of the register, 2) sending the interrupt mes- 
sage (by writing the low register), and 3) waiting for 
Delivery State to become Idle again, should occur as 
a single atomic operation. For example, if interrupt 
handlers are allowed to send inter-processor inter- 
rupts, then interrupt dispensing to the processor 
must be disabled for the duration of these activities. 


DELAY WITH LEVEL TRIGGERED INTERRUPTS 


When a level triggered interrupt source deasserts its 
interrupt input, the destination will clear the inter- 
rupt’s IRR bit only after receiving the message from 
the ICC bus. This introduces a small delay between 
the removal of the interrupt at the source and the 
removal of the interrupt at the processor. To avoid 
generating unnecessary interrupts, the interrupt han- 
dler should remove the interrupt at the source (at the 
device) as early as possible in the handler. 


In any case, handlers should be able to deal with 
unnecessary interrupts. 
RESET DEASSERT 


A side effect of a reset deassertion message broad- 
cast in the ICC bus is that all 82489DX local units 
reset their Arb ID to their unit ID. Interrupt com- 
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mands that use the “‘self’’? Destination Shorthand do 
not generate a message on the ICC bus. If software 
only wants to generate the side effect of resetting 
Arb IDs, it should use a command with Logical Desti- 
nation Mode and a Destination field containing all 

zeroes. , 


INTERRUPT MASKING 


There are a number of levels at which interrupts can — 
be masked, each resulting in a different behavior on 
interrupt delivery. 


e First, interrupt injection (or deliver) can be 
masked by setting the Mask bit in the interrupt’s 
Redirection Table or local Vector Table entry. 
These interrupts are ignored, no message is sent 
for them. Granularity is an individual interrupt. 


e Second, each 82489DX can individually mask in- 
terrupt dispensing by raising its Task Priority to 
some level. This 82489DX will not dispense inter- 
rupts to its processor of this and lower priority 
unless it is currently the focus of the interrupt. 
Note again that the 82489DxX is designed to oper- 
ate as fully nested with non-specific EOI (to use 
existing 8259A terminology). There is no explicit 
interrupt mask (such as MR) and there is no no- 
tion of specific EOI. 


e Third, each processor may provide a mechanism 
that masks all interrupt dispensing to it using the 
processor supplied instructions or status bits to 
do so. This does not interfere with lowest priority 
arbitration of the processor’s 82489DxX local unit. 


CHANGING REDIRECTION TABLES 


Redirection Tables are typically set up at initializa- 
tion time. When modifying a Redirection Table entry 
“on the fly” the programmer must be aware of state 
kept at other 82489DxXs relative to the interrupt be- 
ing modified. 


DEVICE DRIVERS WITH 82489DX 


It is strongly recommended to read the device status 
registers before servicing the device. This is be- 
cause if an edge triggered device deasserts its inter- 
rupt before interrupt acknowledge cycle (it should 
NOT) 82489DxX will NOT give spurious vector. It will 
give genuine interrupt vector corresponding to the 
device. So, interrupt service routine should validate 
the interrupt request before servicing the device. 
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SYSTEM HARDWARE AND 
SOFTWARE DESIGN 
CONSIDERATIONS 


Design Consideration 1 


Description: The following design consideration has 
to be taken care of when using ISP (82357) as exter- 
nal interrupt controller. 82489DX allows connecting 
external 8259 type interrupt controller at one of its 
inputs. The mode associated with the interrupt input 
which has 8259 connected to it is called ExtINTA 
mode. 82489DX allows only EDGE TRIGGERED 
programming option for ExtINTA mode. But in the 
case of 82357, the INT output from ISP stays high in 
case more than one interrupt is pending at its inputs. 
It does not always inactivate its INT output after 
INTA cycle. This will lead to a situation where ISP 
keeps the interrupt at high level continously and 
waits for INTA cycle. But since 82489DX expects an 
edge for interrupt sensing (for ExtINTA interrupts) it 
does not pass the interrupt to CPU and further inter- 
rupts are lost. So External circuitry should monitor 
the end of SECOND CYCLE of INTA cycle and force 
an inactive state at 82489DX’s input. To avoid glitch- 
es at 82489DxX input, this external logic should clear 


its output only at the end of second INTA cycle. It © 


should be set by high going 82357 output. It should 
never be cleared by low going 82357 output. That is 
it should not follow 82357 output. 


Design Consideration 2 


Description: The following design consideration has 
to be taken care of when using 82489DX in EISA 
systems. EISA ISP(82357) chip integrates 8259A. It 
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additionally allows sharing of interrupts. To facilitate 
this sharing it has a programmable register, ELCR 
(Edge / Level trigger control register) by which cer- 
tain interrupt inputs can be programmed as edge 
(low to high except for RTC) or level (the level is 
active low). The determination of edge or level is © 
done during initial configuration of EISA system by 
reading EISA add in boards from the interrupt de- 
scription data structures. The solution is to have pro- 
grammable logic at the interrupt inputs so that 
82489DxX is compatible with EISA ISP. This will intro- 
duce one more register and logic to support this. 
This should be an 11-bit programmable register and 
an array of ExOR logic (12 ExOR gates or equivalent 
PLD). The ISP allows programmability of the follow- 
ing interrupts. 


INT3 INT4 INT5 INT6 INT7 INT9 INT10 INT11 
INT12 INT14 INT15. In addition to the above 11 in- 
terrupts, it fixes INT8 to be active low edge triggered 
interrupt. INT8 is the only case where it is active low 
edge triggered type. So the following logic can be 
used to add programmability in 82489DX based 
EISA system. Before connecting these 11 interrupt 
lines directly (#INT8 which is from Real Time Clock 
is always active low edge triggered. #INT8 can be 
passed through an inverter since there is no need 
for programmability) to the 82489DX they should 
pass through an array of 11 Ex__OR gates. One in- 
put of Ex__OR gate connects to the corresponding 
INT pin and other input connects to a bit of program- 
mable register. The output of Ex__OR gate is con- 
nected to 82489DxX. The idea of Ex__OR is to use as 
a controlled inverter. 


INTINS INTIN4 
INTINS INTING 
INTIN7 INTINS 
INTIN10 


INTIN 14 


INTIN12 


INTIN1S 
eis Lo 


INTIN14 
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INTIN are the interrupt inputs to the 82489DX and 
INT are the system interrupt. The Ex__OR gating 
register is programmed after EISA configuration is 
found from add in boards as how these interrupt 
lines are going to be used in that particular configu- 
ration. If a particular input is edge triggered, then the 
corresponding bit in the register is written with 0. If a 
particular input is level triggered, then the corre- 
sponding bit in the register is written with 1. 


8259 by itself does not have polarity control where- 
as 8259 when implemented in EISA chipsets have 
the polarity control. Similarly APIC does not have by 
itself polarity register. So polarity register should be 
programmed as a part of system BIOS and not APIC 
BIOS. 


Design Consideration 3 


Icc bus drive is an open drain bus with drive capacity 
of 4 mA only. Since data is transmitted at each Icc 
clock, the “charging” of Icc bus should be fast 
enough to ensure proper logic level at each clock 
edge. The Icc bus needs pull up resistors since it is 
open drain bus. Since the drive is only 4 mA, the pull 
up resistor value can not be less than 5V/4 mA. This 
being the limit of the resistor value, the length and 
the characteristics of the Icoc trace forces a capaci- 
tance value. Both the resistor and capacitance 
brings a RC time constant to the Icc bus waveform. 
So, Electrical consideration has to be given to and 
practice of controlled impedence should be exer- 
cised for layout of the Icc bus. The length of the 
trace should be kept as minimum as possible. If the 
length of the Icc bus can’t be kept less, than say 6 
inch, because of mechanical design of the system, 
the external line drivers should be added to Icc bus 
and Icc bus should be simulated with the added driv- 
er characteristics. 


74(ABT/F)125 


Buffered bus 
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R1 can be typically 1K. 
R2 is designed from the simulation results. 
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Design Consideration 4 


This is related to ADS#, BGT# and CS# timings. 
For bus cycles not intended for 82489DX, (CS# = 1 
where 82489DX is supposed to sample it), any 
change in CS# line while the ADS# is still active, 
may erroneously cause a RDY# returned from 
82489DX. Anomolous behavior may result if for 
BGT # ties low cases 


a) BGT # goes away just one clock after ADS# or 


b) ADS# is still active, and CS# changes during 
this period. 


For other cases anomolous behavior results if CS # 
changes when ADS# is still active. The following 
considerations are important from timing point of 
view. Always limit the pulse width of 82489DX 
ADS # to one CLKIN. Also avoid changing levels on 
BGT #/CS# line, when ADS# is active for cases 
being identified as BGT # tied low (BGT # sampled 
low when ADS# goes active). Also avoid changing 
levels on CS# line when BGT # is active. 


Design Consideration 5 


82489DX does not recognize the interrupt when an 
edge occurs at the interrupt input pin while interrupt 
is masked. When later it is unmasked there is no 
further edge and so 82489DX never passes that for- 
gotten edge and that interrupt channel becomes un- 
usable after that. — 


The recommendation is that first 82489DX should 
be unmasked and then the device interrupt should 
be enabled in the device register. By this, software 
can ensure that always an edge will occur after an 
interrupt is unmasked. 


Design Consideration 6 


Description: Edge triggered interrupts should not 
deassert their output till they are acknowledged by 
INTA cycle from CPU. 


issue: 


82489DX employs glitch detection logic for edge 
triggered logic. To make sure the detected edge in- 
terrupt is not a glitch, 82489DX samples the input 
again before sending the interrupt message. The 
time difference between the first sampling of inter- 
rupt to be active and second sampling (just before 
sending the interrupt) is not a constant number. This 
is because the ICC bus might have been occupied 
by other messages. So, for example if during first 
sampling it was detected that INTINO and INTIN15 
are both active and after sending INTINO it samples 
INTIN15 again before sending message for 
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INTIN15. But between this time ICC bus might have 
been occupied by other messages: So even if an 
edge triggered interrupt is held active high for a real- 
ly long time and then brought low before INTA cycle, 
it is considered as a glitch. Because it may happen 
that the second sampling occurred just when the in- 
terrupt line got low. 


Once the glitch detection circuitry found this 
‘glitch’, it goes back to the state where it will start 
sampling and waiting for an active edge to occur. 
This takes more than one clock cycle (CLK) and if 
the “glitch interrupt” generates an edge before that 
time after the second sampling of low level is done, 
then the edge is lost forever. 


BCLK is the standard EISA BCLK. 
CYCLE # is an external signal to indicate ADS# is asserted earlier and RDY low is not yet returned. 
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Since the time when the second sampling is done is 
unknown, the best way is to make sure the edge 
triggered interrupts do not deassert their outputs till 
they are acknowledged by INTA cycle from CPU. It 
is found that in some cases 8259 can generate brief 
active low pulses on its output. So the glue logic 
between 8259 and 82489DX input pin should make 
sure that 82489DX input pin is clear only after get- 
ting second interrupt acknowledge cycle. The glue 
logic should not just follow the 8259 output. Put in 
other words, after interrupt acknowledge cycle to 
8259, if the 8259 input is seen active high, it should 
generate an edge at 82489DxX input. Moreover, even 
if 8259 output goes low the glue logic should not 
lower its output since the only time when the glue 
logic can deassert its output is when it finds an inter- 
rupt acknowledge cycle for 8259. The following PLD 
equations and schematics serves as an example for 
the glue logic between 8259 and 82489DxX. 


INTAcet 
8259Cyc 

set 8259Cyc 
setTimer 

TO 

T1 


APIC Input 
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APIC input = 


+/T0* /Reset * APICinput * INTA cor 


+/INTA got * 8259INT 


Set 8259Cyc = /Reset* INTA* ExtiINTA 


8259Cyc 


INTA Sot 


T5 := T4 * /Reset 


NOTES: 


DIRECTIONS FOR EASY MIGRATION 
TO FUTURE INTEGRATED APIC 


The following are the software programming direc- 
tions Intel strongly recommends for easy migration 
from 82489DxX to integrated APIC. The audience to 
this portion of the document are both hardware de- 
signers and firmware developers for APIC based 
systems. In the following discussions, the APIC 
BIOS is viewed functionally as two subsections 1) 
APIC BIOS which are all interrupt vector, priority, in- 
terrupt distribution related functions and the remain- 
ing portion of BIOS which is referred to part of sys- 
tem BIOS which is responsible for interrupt polarity 
programming, starting next processor, etc. 


} 
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/TO */Reset* /APIC input * 8259INT * INTA cg ; Sample 8259 interrupt 


; This INTA cycle is for 8259 


= set8259Cyc */T0* /Reset ; Set 8259cyc will set 8259 cycle and TO will clear it 


+ /T0 * /Set8259Cyc * 8259Cyc * /Reset ; Hold 8259 cycle till TO clears it 


= /Reset * /INTA,,, * INTA ; wait for'very first INTA cycle after reset 


+ /Reset * INTA,,, ; once first INTA cycle after Reset is found, set the INTA,,; 


Set timer =8259cyc*/A2*/RDYlow _ ; Start the timer at end of second INTA cycle 


TO = Set timer* /T5 * /Reset : Set timer will set TO and T5 will clear 


+ /Set timer * TO * /TS * /Reset ; Till TS clears it hold TO 
T1 := TO * /Reset* /TS ; Follow TO after one clock for setting but clear along with TO 


T2 := T1* /Reset* /T5_ ; Follow TO after two clock for setting but clear along with TO 
T3 := T2 * /Reset*/TS_ ; Follow TO after three clock for setting but clear along with TO 
T4 := T3* /Reset* /T5 ; Follow TO after four clock for setting but clear along with TO 


; Follow TO after 5 clock for setting 


T1, T2, T3, T4 and T5 are clocked signals and others are combinatorials. 

This circuit and PLD equations are given for concept clarification purpose. They are not tested. 
INTAset is needed so that some 8259 logic at power on activates its INT output to 1 and it deactivates its output after 
only 8259 initialization (which should happen after APIC initialization) and since APIC needs to detect rising edge at 
8259, it is essential to follow the 8259 until first interrupt. This is the only occasion 8259 output will be just followed. 


82489DX 


; Hold till it is cleared by delayed interrupt acknowledge 
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Note that the names APIC BIOS and APIC DRIVER 
are interchangeably used in the following discussion. 
Different Operating systems refer such functional 
module differently. 


Consideration 1 


Question: The logical destination register in future 
implemented APIC may have only 8 MSBs defined 
and 82489DX has 32 bits specified. Will this. hinder 
binary level compatibility? 
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Response: In logical destination (flat addressing 
mode) 82489DX can go up to 32 CPUs whereas fu- 
ture APIC can go up to 8 CPUs with flat logical ad- 
dressing mode. For binary compatibility, it is 
strongly recommended that 82489DX software 
use ONLY 8 MSB of logical destination register. 


Consideration 2 


Question: The present day MP systems with exter- 
nal control ports for starting next processor may pro- 
gram those external control ports for starting next 
processor. APIC DRIVER may use external control 
ports for starting next processor. In future implemen- 
tations of APIC, the starting of next processors may 
use more refined mechanisms which may not use 
external control ports. Will this introduce compatibil- 
ity problem? 


Response: Again, the starting of next processor is 
really part of MP system DRIVER and depending of 
the mechanism used to start next processor it will 
vary. In future implementation if starting next proces- 
sor is done using new mechanisms, the starting next 
processor portion of MP DRIVER will be changed 
accordingly. Even though this will not result in any 
change in the APIC DRIVER which deals with inter- 
rupt priority, distribution, etc., the corresponding 
change will be needed in the starting application 
processors portion of DRIVER. 


One possible method of implementing software is 
using version register. Version register is different in 
489DX and future implementations of APIC. Taking 
care of these differences, such as mechanism for 
starting next processor, should be possible using 
version register. — 


Consideration 3 


Question: APIC architecture, by its nature, seems to 
misinterpret spurious interrupts as genuine inter- 
rupts. That is, if an edge triggered interrupt goes in- 
active before interrupt acknowledge cycle, APIC, in- 
stead of giving spurious interrupt vector, gives genu- 
ine interrupt vector. Is it true that this is not the case 
with 8259? If that is the case, drivers which do not 
check device status registers for servicing the de- 
vice may work with 8259 but may not work with 
APIC. Is this a compatibility problem? 


Response: No, this is not true. Even with 8259 there 
is a time window in which a similar thing can happen. 
For example if interrupt goes inactive just after first 
INTA cycle but before second INTA cycle 8259 will 
also signal this spurious interrupt as genuine inter- 
rupt. So drivers which do not check device Ae 
registers may also fail with 8259. 
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Our strong recommendation to device drivers is to 
read device status register before servicing the de- 
vice. If the device status register indicates that there 
is no valid source of the interrupt, the service routine 
should just issue EO! and return. It should not serv- 
ice the device. This should take care of the new driv- 
ers that will be written for APIC. To coexist with 
8259, the APIC interrupt input connected to 8259 will 
be programmed for virtual wire mode. In virtual wire 
mode, the time window of 8259 will apply. So the 
driver will behave same way as it was behaving with 
8259. 


Consideration 4 


Question: EISA system has active low level polarity. 
82489DxX itself does not have polarity control regis- 
ter to support this EISA feature. Implementations us- 
ing external polarity register may implement the po- 
larity register at different address. Will this introduce 
a problem for achieving the goal of single binary? 


Response: 8259 by itself does not have polarity 
control whereas 8259 when implemented in EISA 
chipset has the polarity control. Similarly APIC does 
not have by itself polarity register. When implement- 
ed in ESC chipset, it will have polarity control regis- 
ter. So polarity register should be programmed as a 
part of EISA BIOS and not APIC BIOS. Since system 
BIOS or EISA BIOS should be able to take charge of 
changes, if any, to polarity control register. APIC 
BIOS should not be affected by differences in the 
address for polarity register. 


Consideration 5 


Question: 8259 recognizes the interrupt when an 
edge occurs at the interrupt input pin even if the 
interrupt was masked. So when the interrupt input is 
later unmasked, the interrupt is posted to the CPU. 
82489DxX does not register this edge and if interrupt 
happens when the interrupt is masked 82489DxX just 
ignores the interrupt. When later it is unmasked 
there is no further edge and so 82489DX never 
passes that forgotten edge and that interrupt chan- 
nel becomes unusable after that. 


Response: When the interrupt is masked, logically 
interrupt controller should ignore whatever happens 
there. It is strongly recommended that first 82489DX 
should be unmasked and then the device interrupt 
should be enabled. By this sequence, software can 
ensure that always an edge will occur at the APIC 
input only after the interrupt is unmasked. 


Please contact Intel for platform level specification 
in Multiprocessor system design using APIC. 


ADVANCE INFORMATION 


Application Notes 
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intel. 
INTRODUCTION 


82489DX is the new interrupt controller for high per- 
formance systems and 32-bit OS. Some important con- 
siderations for hardware designers are given. This ap- 
plication note will provide information of all registers 
in 82489DX and their bits and bytes organization. The 
control word for various programming options are giv- 
en in a tabular format. Some programming hints are 
given to facilitate a quick understanding of the inter- 
rupt architecture and the priority model in 82489DX. 


The programming model discusses the registers, their 
data structure like fields, bits, bytes and default register 
values. The system considerations and key points to be 
noted while programming 82489DX are discussed next. 
Typical examples of initialization, interrupt service rou- 
tine and Spl() routines are given. The notes discuss im- 
portant hardware design considerations. 


Related Reference Materials 
1) 82489DX Data Book, Order Number 290446. 


2) An APIC based Symmetric Multiprocessor System 
Design AP-474, Order Number 241521. 


REGISTER ORGANIZATION 


The 82489DX contains both the local unit and I/O 
unit. I/O unit has its own Unit ID and local unit has its 
own Unit ID. Both units are operational at all times 
once they are enabled and the access can be done to 
both units. It should be noted that the local unit has its 
own version register, and I/O unit has its own version 
register, namely, I/O version register. The unit enable 
bit is provided for local unit and it is not provided for 
I/O unit. However, I/O unit has mask bit for each 
redirection table entry to mask the interrupts. Func- 
tionally I/O unit can only transmit interrupt messages 
whereas local unit can both transmit and receive inter- 
rupt messages. In summary, 82489DX should be 
viewed as an integrated chip having a local unit and an 
I/O unit both capable of operating at the same time. 


INITIAL REGISTER VALUES AFTER 
HARDWARE RESET 


The local unit ID register latches the value on the ad- 
dress pins A3 to A10 after hardware reset whereas the 
I/O unit ID register gets cleared to 0 after hardware 
reset. The local unit Version Register is cleared to 0 
whereas the I/O unit Version Register contains 1111 in 
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its Max Redir Entry field. The interrupt masks in the 
local timer vector table register and in the I/O redirec- 
tion table entry(31:0) registers are set so that after reset 
all the interrupts are masked. The spurious vector regis- 
ter’s unit enable bit is cleared so that local unit is dis- 
abled after hardware reset. Since all the interrupts are 
masked after hardware reset, the I/O unit will not 
transmit any interrupt after hardware reset until mask 
is cleared specifically by software and the interrupt is 
active. 


All other registers are cleared to 0 after hardware reset. 


SYSTEM CONSIDERATIONS WHILE 
PROGRAMMING THE 82489DX 


The 82489DX register data structure contains different 
fields to specify the mode of operations and the options 
available within each mode. Since certain options are 
applicable to specific modes only (for example ‘““Remote 
Read” mode applies only to Interrupt Command Regis- 
ter, it does not have any relevance to I/O unit’s redirec- 
tion tables) the following programming hints are pro- 
vided. 


82489DX and Memory Mapping 


The 82489DX is a 32-bit high performance interrupt 
controller. It’ allows the CPU to do 32-bit read and 
write to it. By memory mapping the 82489DX, system 
performance can be enhanced. Even though the 
82489DX can be memory mapped, its functionality as 
an interrupt controller should be kept in mind while 
programming the virtual memory management control 
data structure. The caching policy for the page where 
an 82489DX is mapped should also be done with the 
functionality of the 82489DX in mind. For example, 
the reads to an 82489DX should not be cached and 
writes should be write-through. Since 82489DX regis- 
ters are aligned at 128-bit boundaries, memory map- 
ping the 82489DX with interleaved memory system 
should not be a problem. However, it should be noted 
that the 82489DX does not support pipelining. 


Unique ID Requirement 


All the local units and I/O units hooked on an Icc¢ bus 
should have an unique ID before they can use the bus. 
This should be ensured by the programmer, since for 
Icc bus arbitration the units (whether it is local unit or 
I/O unit) arbitrate with their unit ID. 
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PROGRAMMING THE LOCAL UNIT 


Dos and Don’ts 

1. The local interrupt vector table entry (and the I/O 
unit redirection table entry) should not be pro- 
grammed for “Remote Read” Delivery mode. In 
other words, only Interrupt Command register sup- 
ports “Remote Read” Delivery mode. 


2. Local Interrupts should not be programmed with 
“Lowest Priority” Delivery mode. 


3. Local Interrupts should not be programmed with 
“Reset” Delivery mode. 


4. It is not recommended to use level triggered mode 
except for ‘““Reset Deassert’’ messages. 


Atomic Write Read to Task Priority 
Register 


This section discusses issues regarding write buffer 
flushing and necessity of atomicity of task priority reg- 
ister programming. 


Typically, the task priority register is written with high- 
er priority to mask certain low level interrupts before 
entering into a critical section code. In a system where 
an 82489DX is memory mapped the CPU may buffer 
this task priority register write to its on chip write buff- 
er. The following scenario can happen in such situation: 
CPU posts task priority register write to its on chip 
write buffer and enters into the critical code. A lower 
priority interrupt (which should not enter the critical 
code) interrupts the CPU before the write buffer gets 
flushed into task priority register). The CPU now erra- 
neously accepts the lower priority interrupt. To avoid 
the situation, atomic write and read to task priority 
register should be done. The read following write en- 
sures that the write buffer is flushed to task priority 
register and the atomicity ensures that no interrupt will 
be accepted by the CPU during its write to task priori- 
ty. In case if the CPU itself takes care of flushing its 
write buffers before INTA cycle, there is no problem. 
However, if there are system posted write buffers then 
external logic should make sure to flush the system 
write buffers before INTA-cycle. 


Task Priority Register and Total Usable 
Vectors 


Task priority register is used to specify the priority of 
the task the processor is executing. In 8259 the priority 
is defined only among the interrupts that it handles. 
82489DX goes further ahead in handling priority. In 
multitasking system, in addition to device interrupts, 
various tasks have different priority and 82489DX al- 
lows consideration of the priority at system level. The 
processor specifies the priority of the task it executes by 
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writing to task priority register. Now any interrupts at 
or below the task priority will be masked until the task 
priority gets lowered. The masking granularity is at pri- 
ority level. Out of 256 interrupt vectors 16 priority lev- 
els are specified and 16 vectors share one priority level. 
Since the masking granularity by the task priority regis- 
ter is at priority level, group of 16 vectors get masked 
when a local unit increases its task priority by one level. 


When task priority register is at its minimum level of 0, 
interrupt vectors having level 1 to 16 are passed to 
CPU. Stated in other words, even when the task priori- 
ty register is at its minimum (level of 0), interrupt vec- 
tors at level 0 will be masked. This means that the in- 
terrupts should not be programmed with vectors 0 to 
15. So out of 256 interrupt vectors, only 240 interrupt 
vectors (vector 16 to 255) can be used in 82489DX. 


ISR/IRR/TMR 


1. Bits O-15 of IRR/ISR/TMR do not track inter- 
rupt. No interrupt of vector number from 0-15 can 
be posted. The total interrupts supported are 240. 
This can be easily explained by the way the priority 
mechanism is defined. When reading the lowest 32 
bits of this register, 0 will always be returned for the 
lower 16 bits. 


Interrupt Command Register 
Programming Considerations 


The interrupt command register (31:0) has the side ef- 
fect of sending interrupt once it is written. There is no 
mask bit associated with Interrupt Command Register. 
Once interrupt command register (31:0) is written, the 
interrupt is sent from the local unit. The interrupt des- 
tination is provided in the interrupt command register 
(63:32). So, the interrupt command register (63:32) 
should always be programmed before the interrupt 
command register (31:0) is programmed. 


Disable interrupt 
Program interrupt Command Register(63:32) 


Program interrupt Command Register(31:0) 


Enable interrupt 
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CRITICAL REGIONS AND MUTUAL EXCLUSION 


This section discusses the reasons for mutual exclusion 
to be exercised when writing to interrupt command reg- 
ister. Each 82489DX has a single Interrupt Command 
Register that is used to send interrupts to other proces- 
sors. The programmer should make sure to synchronize 
access to this register. Specifically, (1) writing all fields 
of the register (MSB), (2) Sending the interrupt mes- 
sage (by writing the LSB register), and (3) waiting for 
Delivery State to become Idle again, should occur as a 
single atomic operation. For example, if interrupt han- 
dlers are also allowed to send inter processor interrupts, 
then interrupt dispensing to the processor must be dis- 
abled for the duration of these activities so that inter- 
rupt handlers are excluded from accessing the ICR. 
This is explained as follows. Let us assume in a typical 
MP system preemptive scheduling (on another proces- 
sor) is implemented by sending inter processor inter- 
rupts (IPIs). IPI can be also used for clock distribution 
in an asymmetric system where the timer interrupts 
only one processor and that processor notifies all other 
processors in the system through IPI. Inter processor 
interrupts are implemented by using interrupt com- 
mand register. If we allow interrupts during writing 
interrupt command register the following erraneous op- 
eration may result. If interrupts are enabled (they 
should not be) during writing to interrupt command 
register, interrupts can come after writing to MSB por- 
tion and before writing to LSB portion of interrupt 
command register. Now in the interrupt service rou- 
tine, if ICR is used (for distribution of interrupt to oth- 
er processor(s), for example) then this ISR also starts 
writing to the Interrupt Command Register. That 
means the ISR will overwrite the MSB portion just 
written by the previous IPI. After returning from the 
ISR when the previous IPI continous writing to the 
remaining LSB portion, the message will be delivered to 
wrong address since MSB is modified by the module 
which interrupted. The inference is that while access- 
ing ICR interrupts should be disabled. Also it should be 
noted that except for “‘Reassert Deassert Messages’’, 
IPI should only use edge triggered mode. 


BUFFERING IN INTERRUPT COMMAND 
REGISTER 


The Interrupt Command Register provides one level of 
buffering which should be kept in mind while program- 
ming an 82489DX. The ICR (Interrupt Command 
Register) becomes busy as soon as inter processor mes- 
sage is written into it. It hands the message over to Icc¢ 
bus transmit unit which in turn tries to send through 
Icc bus. Since the ICR has passed the command to 
transmit Unit (whose responsibility is to send it 
through Icc bus) it becomes free. The software before 
writing next inter processor message reads the flag to be 
free and writes next message. Thus there is a possibility 
of next message being written into the 82489DX before 
the first message is really sent out. The programmer 
should be aware of this. 
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INTERRUPT COMMAND REGISTER DOS AND 
DON’TS 


1. “ExtINTA” delivery mode should not be used for 
all destination shorthand. 


2. “Remote Read’ should always be programmed as 
“Edge” triggered interrupt. 


3. “Remote Read” should always be programmed with 
physical Destination mode (and not with Logical 
Destination mode). 


3. Only Fixed Delivery Mode should be used for ‘‘Self”’ 
destination shorthand. Stated otherwise, “lowest 
priority”, ““Remote Read”, “Reset’’, ““NMI”’ deliv- 
ery modes do not apply for “Self”. 


4. For “All incl. self’ and “‘All excl Self’? destination 
shorthands, “Remote Read” delivery mode should 
not be used. 


5. For “All incl. self’ and “Self? destination short- 
hands “Reset”? Assert mode should not be used. 


6. For “All exclusive self’? destination shorthand if 
“Reset ASSERT” delivery mode is used, it should 
be ensured at system level that only one processor 
executes this instruction at any time. To explain 
this, let us consider the following situation. Let us 
assume that two CPUs, CPU A and CPU B are 
executing “Reset ASSERT, All Exclusive Self’. The 
message of CPU A puts every CPU except CPU A 
in reset state. After the message is written by CPU 
A it typically takes 2.9 ys for the message to flow 


through the Icc bus to reach other local units to — 


reset all other processors. Before this message resets, 
let us assume another processor also, say CPU B, 
issues the ““Reset ASSERT, All Exclusive Self’ mes- 
sage. The following CPU B message (which was sent 
out before CPU B itself got reset because of CPU A 
reset message) will reset every CPU, which will in- 
clude CPU A, except CPU B. But CPU B will even- 
tually get reset by the message sent by CPU A and 
CPU A will also get reset by the message sent by 
CPU B. Thus all the CPUs in the system goes into 
reset state and this is an irrecoverable state. To 
avoid this, only one processor should execute this 
instruction at any time. This can be achieved, for 
example, by spinlock or mutex implemented as 
shared variables between multiprocessors. 


7. Messages could be sent out in “Logical” or ‘“‘Physi- 
cal’? mode with destination ID of all 1’s depending 
on the way Destination Mode entry is programmed. 
In brief, “All incl. self’ and “All excl. self?’ supports 
both “Logical” and ‘“‘Physical” addressing mode. 


8. When destination shorthand (i.e., broadcast) is used 
with “lowest priority” destination mode, then even 
though all participates in arbitrating for destination, 
only the lowest priority gets the message. So even 
though the addressing is broadcast since the destina- 
tion mode is lowest priority only one gets the mes- 
sage. 
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9. When destination shorthand (i.e., broadcast) is used 
with “Fixed” destination mode, then all the units 
get the message. So to send messages to all units 
Fixed destination mode should be used in addition 
to using destination shorthand. 


10. It is recommended that all IPI messages, except for 
“Reset Deassert’’, use only edge triggered interrupt 
mode. 


IP! THROUGH INTERRUPT COMMAND 
REGISTER 


Interrupt command register can be used to send inter 
processor interrupt. Inter processor interrupts can be 
used for preemptive scheduling, TLB flushing, clock 
distribution, etc. IPIs can also used in asymmetric sys- 
tems to pass certain work to another processor who has 
exclusive access to certain piece of hardware. Let us 
consider a dual processor system which uses only two 
82489DX in the whole system. Since the local units 
should be accessible only by their respective processor a 
local unit should be selected only when the arbitrator 
grants the bus to its processor. Since 82489DX has 
common chip select for its local unit and I/O unit, for 
logical simplicity, system hardware may select a 
82489DX when the corresponding processor is granted 
bus. Because of this, processor A can access only I/O 
unit and local unit that are available in its 82489DX. It 
is not possible to access the I/O unit of the other 
82489DX. The same thing holds good for the other 
processor. Since I/O unit should be globally visible to 
both processors, there may be situations when a proces- 
sor may want to access the other I/O unit. This is typi- 
cally the case for enabling and disabling the I/O inter- 
rupt. IPI can be used to pass that task to the other 
processor which can access that I/O unit. This is just 
one example for using IPI. 


ExtINTA INTERRUPT POSTING 


ExtINTA interrupts are used to support 8259 in a 
82489DX based system. The external interrupts (Ex- 
tINTA) are specific in their characteristics in that they 
do not have any priority relationship with rest of the 
interrupt structure. But when posting an interrupt to 
the processor, if both an external interrupt and a 
82489DX interrupt are pending, 82489DX could post 
either one to the processor. In 82489DX implementa- 
tion, it would post external interrupt whenever there is 
no other 82489DX interrupt that can be posted to the 
processor. It should be also noted that External Inter- 
rupts can not be masked by raising task priority. How- 
ever, they can be masked by the mask bit in the table 
entry for that (ExtINTA) interrupt. 


Since ExtINTA interrupts do not have any priority re- 
lationship, ISR and IRR bits are not maintained for 


3-6 


intel. 


external interrupts. As far as interrupt acceptance is 
concerned, if more than one ExtINTA interrupts are 
directed towards a local unit, that local unit treats all 
the ExtINTA interrupts directed to it as only one Ex- 
tINTA interrupt. This leads to an important point that 
in a system no more than one interrupt should be pro- 
grammed as ExtINTA interrupt type with the same 
destination. However, it should be noted that there can 
be more than one ExtINTA type of interrupt in a sys- 
tem with each having different local unit as destination. 


LOWEST PRIORITY 


Under the lowest priority delivery method, the proces- 
sor to handle the interrupt is the one in the specified 
destination with the lowest processor priority value. If 
more than one processor is at the lowest priority, then a 
unique arbitration ID is used to break ties. To have 
unique arbitration ID in the system (which is mandato- 
ry for the lowest priority algorithm to work) all the 
arbitration ID of local 82489DXs in the system should 
be in sync. On réset, arbitration ID is reset to zero by 
the hardware. Hence all the local units in the system 
after reset will have same arbitration ID (namely zero). 
For lowest priority arbitration to work properly we 
need to have unique arbitration ID in the system. This 
means after local unit IDs are written in all local units 
(obviously, each unit ID should be different from other 
IDs) a RESET DEASSERT message should be sent in 
ALL INCLUSIVE mode. The important side effect of 
RESET DEASSERT message is that it copies the 
unitID into the respective arbitration ID. Since unit 
IDs are unique, the RESET DEASSERT message en- 
sures that the arbitration ID also are unique in the sys- 
tem. This RESET DEASSERT message should be sent 
before system is used for lowest priority arbitration. 


The RESET DEASSERT message, if not sent, only 
once delivery semantics may not be guaranteed. If 
RESET DEASSERT message is not sent then all the 
arbID in the system will be same. When a message is 
sent in the lowest priority arbitration, the participating 
local units use their processor priority concatenated 
with arbitration ID to decide the destination. Processor 
priority is derived from the task priority. There is a 
chance that two local units can have same task priority 
depending on the code they are executing and thereby 
same processor priority. In addition since arbID are 
also same if RESET DEASSERT message WAS NOT 
sent, all the processors in the same priority may accept 
the message in lowest priority arbitration. This violates 
the only once delivery semantics. The inference is that 
RESET DEASSERT message in ALL INCLUSIVE 
SELF mode should be sent as part of initialization be- 
fore enabling interrupt in the lowest priority destination 
scheme. 
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It should be noted that only once delivery semantics for 


a group destination is guaranteed only if multiple fixed 
delivery of the same interrupt vector are not mixed. 


DISABLING LOCAL UNIT 


Once the 82489DX is enabled by setting bit 8 of spuri- 
ous vector register to 1, the user should not disable the 
local unit by resetting the bit to 0. The result will put 
the local unit in an inconsistent state. However, a local 
unit can be disabled by getting “reset” interrupt mes- 
sage from any other local unit across the Icc bus. 


ISSUING EOI 


EOI, End of Interrupt issuing indicates end of service 
routine to 82489DX. Always the highest priority ISR 
bit which is set during INTA cycle gets cleared by EOI. 
This section discusses the relevance of EOI to the spe- 
cific types of interrupts and its timing related to inter- 
rupt deassertion. 


EXTERNAL INTERRUPTS AND EOI 


External Interrupts (ExtINTA) should be programmed 
as edge type. INTA cycles to external interrupts are 
taken automatically as EOI by 82489DX. This is simi- 
lar to AEOI, Automatic End of Interrupt of 8259A. So 
EOI should not be issued to 82489DX for ExtINTA 
interrupt servicing. For ExtINTA type of interrupts, 
there is no need to have interrupt service routines since 
the main purpose of ExtINTA interrupt itself is to have 
software transparency in the compatible mode. The ex- 
isting interrupt service routines written for 8259 will be 
executed by the processor for ExtINTA interrupts. 


SPURIOUS INTERRUPTS AND EOI 


Spurious Interrupts do not have any priority relation- 
ship to other interrupts in the system. So IRR is not set 
for spurious interrupts. EOI should not be issued for 
spurious interrupts. It is advisable not to share the spu- 
rious interrupt vector with any interrupt. 


If spurious interrupt vector is shared with some other 
interrupt then the following guidelines should be fol- 
lowed. If the source is spurious interrupt (for which the 
corresponding ISR is not set) then EOI should not be 
issued. If the source is a valid interrupt sharing the 
spurious interrupt vector (for which the corresponding 
ISR is set) then EOI should be issued. 


NMI AND EOI 


For NMI type of interrupt no IRR bit is set. So, obvi- 
ously EOI should not be issued while servicing NMI 
type of interrupts. 


PRELIMINARY 


AP-388 


PROGRAMMING I/O UNIT 


interrupt Sharing Considerations 


Two different interrupts should not be programmed 
with the same interrupt vector. This means that each 
redirection table in a system should have unique vector. 
Interrupt sharing can be done electrically. Interrupts 
connected at different interrupt input pins of 82489DX 
CAN NOT share interrupt by having same vector. 
82489DX does not support active low interrupts. So 
sharing interrupts should have polarity logic support 
externally. 


I/O Unit and Priority 


The 82489DX partitions its interrupt control function 
among two different units: 


1. I/O unit 
2. local unit 


The priority resolving is done at local unit. The I/O 
unit does not involve itself in the priority mechanism. 
The I/O unit takes a snapshot of interrupts pending at 
the INTIN interrupt input pins. If interrupts are active, 
it starts sending the interrupt messages over Icc bus. It 
starts sending the lowest numbered interrupt input 
first. That is if INTINO and INTINS are found active 
in a snapshot, interrupt message corresponding to IN- 
TINO is sent first regardless of the priority of the vec- 
tors that are associated with these interrupts. It sequen- 
tially sends all the interrupts found active in a snapshot. 
Before sending, it checks whether the corresponding 
INTIN is still active. This is the reason why interrupts, 
both edge and level triggered, should be kept active 
until CPU acknowledges it. The difference between 
edge triggered and level triggered interrupt is that edge 
triggered interrupts ensure only one activation of inter- 
rupt per low to high edge whereas the level triggered 
interrupt allows to have multiple interrupts as long as 
the interrupt is held high. It should be noted that both 
edge and level triggered interrupts are active high. 


MP SYSTEM 


Initialization Sequence 


This section assumes the system with multiple CPUs 
with each CPU having its own 82489DX local units 
and local interrupts (like local secondary cache data 
parity interrupt, coprocessor interrupt etc.,) connected 
to the respective local units. The system additionally 
assumes symmetric multiprocessing in the sense that 
I/O system is symmetric and it can be initialized by any 
CPU in the system. 
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Section A: Code Executed by all CPUs in the 
system 


Section B: Synchronization to indicate Section A 
is completed 


Section C: Only one CPU need to execute this 
Code 


Each local unit is visible (through address mapping) 
only to that CPU to which local unit is attached. So 
each local unit will be programmed by its own CPU. 
Thus the code specified as section A will be executed by 
all CPUs in the system. 


Section B of the initialization code is also executed by 
all the CPUs. This section of the code ensures that all 
the CPUs have completed execution of their “Section 
A” so that all Local units are properly initialized with 
different IDs, the system is in a consistent state, etc., 


I/O unit is system wide, only one CPU need to pro- 
gram the I/O unit part of the 82489DX. 


Write the local Unit ID (if needed) 


Write all Ones to Destination Format Register 


Write Logical Destination Register 


Raise the Task Priority 


Program the Spurious Interrupt Vector 
Vector and Enable the Local Unit 


Program the Vectors for Local Interrupts and 
Timer 


Program the Timer Control Registers 
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Clear the mask for Local Interrupts and Timer 


f 


intel. 


It should be noted that the interrupt descriptors, inter- 
rupt service routine, spurious interrupt service routine 
and other interrupt related structure should have been 
initialized before the Section A. This is because 
Section A code enables respective local interrupts and 
timer interrupt vectors and when the interrupts arrive 
from these devices Section A ensures that 82489DX 
will provide the vector. But the code executed before 
Section A should ensure the interrupt structure is ini- 
tialized. Spurious interrupts are bound to occur because 
of the asynchronous interaction between interrupts and 
software writing to task priority register. So spurious 
interrupt service routine has to be initialized before Sec- 


‘tion A. 


WRITE THE LOCAL UNIT ID (IF NEEDED) 


Each local unit can get the ID latched by reset from 
82489DX address pins A3-A10. If the hardware en- 
sures that during reset each local unit in the system gets 
different pattern on the address pins A3—A10 then all 
the local units are initialized automatically with differ- 
ent IDs. In that case writing to the local unit ID by 
software is not mandatory. If software writes the local 
unit ID then it should be read from some address space 
which is same for all CPUs but have different IDs for 
different CPUs. This will ensure that the same code 
when executed by different CPUs will initialize respec- 
tive local unit with different ID. 


WRITE ALL ONES TO DESTINATION FORMAT 
REGISTER 


All the 32 bits of Destination Format Register are writ- 
ten with 1. This is to support single level logical ad- 
dressing mode. This mode is explained in the following 
paragraph. 


WRITE TO LOGICAL DESTINATION REGISTER 


The logical Destination Register should be written with 
the logical destination address. It should be noted that 
since each CPU needs to assign a different logical Des- 
tination address to its own 82489DX local unit and 
since this code is executed by all CPUs the logical Des- 
tination address should be read from some address 
space which is the same for all CPUs but contains dif- 
ferent Destination address values. Since logical Desti- 
nation address is in bit decoding format, typically this 
can be achieved by shifting the CPUID. 
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The logical destination register with Destination format 
register can be used to support flat model. In this mod- 
el, bits 24 through 31 of the destination address of the 
interrupt message vector are interpreted as decoded 
field. Intel strongly recommends for future compatibil- 
ity to use only bits 31 to 24 of logical destination regis- 
ter. To have binary compatibility with future APIC im- 
plementations, any code written for 82489DX should 
not use bits 0 to 23 of logical destination register. This 
field is compared against the logical destination register 
of the local unit. If there is a bit match (i.e., if at least 
one of the corresponding pair of bits of the destination 
field and logical destination register match) this local 
unit is selected for interrupt delivery. Each bit position 
in the destination field corresponds to an individual lo- 
cal unit. For future compatibility, only bits 0 to 23 of 
logical destination register should be zero. This scheme 
allows the specification of arbitrary groups of 
82489DXs simply by setting the member’s bit to one, 
but allows a maximum of 8 local units in the system 
since bits 0 to 23 of the logical destination register is 
zero. Broadcast to all is achieved by setting all 8 bits of 
destination to ones. This selects all 82489DXs in the 
system. 


If more than 8 units are to be addressed in the system 
(and if future compatibility is not a major issue) then all 
the bits of the logical distination register can be used as 
a bit map thereby increasing the number of CPUs ad- 
dressable in logical addressing to 32. 


RAISE THE TASK PRIORITY 


Before enabling the local interrupts and timer inter- 
rupts the task priority is raised to maximum priority in 
the system so that these interrupts are masked tempo- 
rarily. 


PROGRAM THE SPURIOUS INTERRUPT 
VECTOR AND ENABLE THE LOCAL UNIT 


The spurious interrupt vector register is programmed 
with the corresponding vector. This vector will be 
pointing to a dummy routine with just an IRET. The 
unit is enabled so that the tristate pin PINT can come 
out of tristate state to pass the interrupts. 


PROGRAM THE VECTORS FOR LOCAL 
INTERRUPTS AND TIMER 

The interrupt vectors for Local Interrupts and timer 
are initialized with corresponding vector. 


PROGRAM THE TIMER CONTROL REGISTERS 


The timer registers such as divider configuration regis- 
ter, initial count, mode of operation and source of the 
timer clock are programmed. 
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CLEAR THE INTERRUPT MASK FOR TIMER 
AND LOCAL INTERRUPT 


The interrupt mask is cleared for timer and local inter- 
rupts by clearing the interrupt mask bit in their respec- 
tive interrupt vector register 


INITIALIZE THE LOCAL INTERRUPT SOURCE 


The local interrupt sources are also programmed for 
proper system operation. This involves enabling the in- 
terrupt from the sources. The order of enabling the in- 
terrupt is very important. First the 82489DX entries 
should be cleared and then the sources connected to the 
pins should be enabled. If done the other way, inter- 
rupts may get lost. This is true particularly in edge 
triggered interrupt inputs where if 82489DX mask is 
cleared after enabling the source interrupt, 82489DX 
may not have a chance to capture the low to high edge 
which might have produced immediately after the 
source interrupt is enabled and before 82489DX mask 
is cleared. 


BROADCAST ALL INCL. SELF RESET 
DEASSERT MESSAGE 


This is done so that all the local unit’s ArbIDs are in 
sync. It should be noted that for breaking the tie during 
lowest priority arbitration ArbID is used. ArbID is 
copied from local unit ID during reset. Since local unit 
IDs can be written through software and at that time 
ArbID is not updated there may be a case where all 
ArbID in a system to have same value. To avoid such 
situation Reset Deassert message is sent to ALL INCL. 
SELF so that the ArbIDs are different in the system. 


LOWER THE TASK PRIORITY 


Task priority is lowered so that the interrupts can be 
armed to the CPU. | 


Section B 


Synchronization 


There are many methods available for synchronization. 
Test - and - set is a simple primitive, for example, avail- 
able for synchronization. Counting semaphores can be 
built using this test - and - set primitive and synchroni- 
zation can be achieved. 


The main idea is to achieve global synchronization 


among the processors to indicate the local unit portion 
is programmed. 
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Section C 


SYSTEM WIDE RESOURCES PROGRAMMING 


This portion needs to be programmed by one CPU 
only. It should be noted that since the system environ- 
ment we are assuming is shared memory symmetric 
MP system, CPU specific coding is not possible. System 
wide resource programming can be achieved by many 
ways depending on simplicity and performance (since 
this is only initialization routines, performance should 
not matter much) tradeoff. The following sequence il- 
lustrates a simple approach to program. The assump- 
tion here is that 82489DX will get reset both during 
cold reset and warm reset. 


Locked access to the system wide resource, 
1/O unit 


Read a specific MASK from 
Redirection Table Entry 


If the mask is set, Jump to Prog. I/O unit 


If the mask is not set, Release lock and Jump 
to 1/0 unit Done 


Prog. I/O Unit: Write to the index register to 
select unit ID reg 


Write I/O unit ID in the ID register 


Write to index register to select I/O unit 
Version Reg 


Read the Version reg. to know no. of 
RedirTable Entries, N 


RedirTble: Write to index register to address 
MSB Redir. Table n 


Write to MSB Redirection Table Entry n 
7 Destination local unit ID 


Write to index register to address LSB 
Redir. Table n 


Write to LSB Redirection Table n 
mode,dest.,mask, Vector of INT 


| 
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Loop to RedirTble: till all N (here N = 16) 
Entries are done 


Release the lock 


I/O unit done: Remaining system init like |/O 
system etc., 


starting from 0 and I/O unit will be initialized by the 
system to non Zero ID, I/O unit can be read and if 0 


gramming the I/O units by jumping to I/O unit done. 
The I/O unit registers are organized as index register 
and data register. Other portions of section C are self 
explanatory. After I/O unit done, the I/O system ini- 


tialization can be started so that the interrupts can start 
flowing in the system. 


INTERRUPT SERVICE ROUTINE 


ISR (x) 
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The above ISR x shows the interrupt service routine of 
interrupt level x. The stack, hardware context like CPU 
registers and software context like task specific vari- 
ables are saved. The Servicing is done as specific to the 
interrupting source. This may involve reading a status 
register or initiating a thread to read a “full” buffer, 
initiating a thread to write data to some “empty” regis- 
ter, or acknowledging an interrupt from another CPU. 
This is the point at which the interrupting source is 
supposed to deactivate its request. Next EOI is issued 
to reset the ISR bit corresponding to the interrupt level 
x. Till now the interrupts from same level and lower 
levels were masked. Once EOI is written, interrupts 
from all the levels can start coming. The hardware con- 
text and software context are restored followed by stack 
cleaning and a proper Return from Interrupt is execut- 
ed. 


There are couple of timing issues that can be considered 
here. The time delay between Service and EOI is re- 
ferred here. This timing and its relevance to edge/level 
triggered interrupt is discussed as follows: In the case of 
edge triggered interrupts, for each edge one Interrupt 
message is sent by I/O unit to local unit over Icc bus 
whereas for level triggered interrupts there are two in- 
terrupt messages sent, one during assertion of level in- 
terrupt, and another during deassertion of level inter- 
rupt. In edge triggered interrupts, since the deassertion 
of interrupt does not result in any interrupt message, 
there are not many issues with the timing delay be- 
tween Service and EOI, even though in general delay- 
ing EOI means interrupts from the same interrupt 
source are kept pending from interrupting CPU. In lev- 
el triggered interrupts after service the I/O device 
starts deasserting its interrupt request. This results in 
an interrupt message to clear IRR bit in the local unit. 
This may take some time because the minimum possi- 
ble time in Icc bus is 2.3 ws (10 MHz Icc clock as- 
sumed). If the Icc bus is occupied by some other mes- 
sages already then this IRR clearing message has to 
wait to get its turn which means additional delay. If 
EOI is issued before this happens then ISR gets cleared 
and IRR for this “done interrupt” is still alive to erro- 
neously set ISR again. This will result in another inter- 
rupt. So “Early Servicing” is advisable in level triggered 
interrupts. 


DOS Environment 


In the DOS environment the initialization portion is the 
only routine to be coded since the 82489DX acts as a 
virtual wire once initialized and needs no more pro- 
gramming. Since it is uniprocessor environment there is 
no need for synchronization. 


The interrupt from 8259 is programmed as type 
ExtINTA and other redirection table entries are not 
accessed since their masks are set by reset and hence 
disabled. 
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In the Interrupt service routine, since EOI is not need- 
ed for ExtINTA type of interrupts, no programming is 
needed for 82489DX. Since ExtINTA type of inter- 
rupts do not have any relationship to task priority, Spl 
routines do not apply for DOS configurations. 


Transition from 8259 to 82489DX 


Typically, platforms with 82489DX will support 
82489DX in virtual wire mode. The BIOS in the 
EPROM will program the 82489DX in “Virtual Wire”’ 
mode. Typically systems boot DOS and then the 32 bit 
high performance OS is given control. There are also 
situations where after BIOS code is executed the high 
performance OS is given control. In both the situations, 
the 8259 will be operational during the DOS or BIOS 
portion of the code and interrupts will be flowing in the 
system. When the high performance OS is given con- 
trol, it may want to disable the interrupt during initiali- 
zation. This will involve disabling 8259. After disabling 
8259, the 32 bit OS initializes and then it may want to 
enable interrupt mechanism which involves enabling 
82489DX. The sequence we are encountering here is 
8259 (and one input of 82489DX enabled in 
“ExtINTA” mode) enabled, 8259 disabled and then 
82489DX enabled. When 82489DX is enabled in 32 bit 
OS all the interrupt inputs are enabled as opposed to 
the only one interrupt enabled in “Virtual Wire” mode. 
The additional difference is that 82489DX is no more a 
“virtual wire” but it is functioning as an interrupt con- 
troller. 


In the above situation, consider the following scenario. 
The 82489DX is functioning as “virtual wire” and 
passing the 8259 interrupts as ““ExtINTA” mode to the 
local unit. When interrupt mechanism is disabled by 
CLI (Clear interrupt) or masking the 8259 interrupt, 
there may be a possibility that already 8259 originated 
interrupt may be pending at the local unit asserting 
interrupt to the CPU. Now since the CPU has executed 
CLI, the interrupt is not serviced and the interrupt is 
kept pending. It should be noted that the pending inter- 
rupt is of type “ExtINTA”’. After this, 32 bit OS gets 
loaded which configures 82489DX redirection tables 
and interrupt is enabled. Now the “old pending” inter- 
rupt is delivered and since it is ““ExtINTA” the external 
hardware will typically pass the interrupt acknowledge 
cycle to 8259. But at this point of time 8259 has been 
masked by 32 bit OS. Hence the “masked” 8259 re- 
sponds with IR7 vector. So the 32 bit OS should reserve 
IR7 vector for both master and slave 8259 for “‘empty- 
ing” the old pending interrupt since the “Virtual Wire” 
remembers the previous interrupt. 


Sequence of Enabling: In the case of enabling interrupt 
controllers in “ExtINTA” mode the 82489DX should 
be enabled before the 8259 interrupt Controller is en- 
abled. This is because ExtINTA is “‘edge triggered”’ and 
if 8259 is enabled before 82489DX, 8259 might have 
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given an interrupt request by activating its interrupt 
output while 82489DX is still not enabled. When 
82489DX is enabled the interrupt input has a high level 
and 82489DX had no chance of capturing the low to 
high edge of 8259. 


Spurious Interrupt Service Routine 


It is advisable not to share spurious interrupt vector 
with any genuine interrupt source. This section assumes 
that spurious interrupt vector is not shared with any 
other interrupt. 


82489DX does not set ISR in response to spurious in- 
terrupt, NMI type of interrupt, Reset type of interrupt 
and ExtINTA type of interrupts. For all these inter- 
rupts EOI should not be issued. 


Some systems have a variable count in the supervisor 
data structure to count number of spurious interrupts 
raised in the system. This can be used to study the 
reliability and “noise level” of the system. But in 
82489DX architecture, spurious interrupt can occur 
even by a dynamic write to task priority register, fre- 
quency of spurious interrupt does not mean anything 
related to “noise level’. 


Return from interrupt 


Spl(x) Routines 
The processor handles the I/O system through device 


driver interface. The device driver consists of two en- 
tries to access the I/O system: 1) Call entry and 2) 


Typical usage of these routines 


intel. 


Interrupt entry. The interrupt entry is the one that we 
have been discussing for a while, i.e., interrupt service 
routine. The Call entry is the way the I/O system is 
accessed to initiate and service devices. The call entry 
has its own task priority and interrupt entry has the 
priority that is associated with the device interrupt lev- 
el. The call entry and interrupt entry processes have 
I/O data structure like linked list, buffer pointers in 
common which they share. Mutual exclusion is needed 
to ensure the integrity of I/O system. 


To ensure the mutual exclusion between these two pro- 
cesses running in the same processor, Spl(x) routine is 
used. The call entry routine (which is normally at a 
lower priority than the interrupt entry routine) calls 
Spl(x) routine to elevate its own priority above (or 
equal to) that of the corresponding device’s interrupt 
priority. At this priority the interrupts from the device 
are masked out and the shared data structures can be 
accessed (exclusively). 


Once this is done the priority is restored back to origi- 
nal value so that other interrupts won’t suffer for rela- 
tively long time. 


Spl is used to save the current task priority and Spl(x) 
is used to elevate the task priority. 


Spi() 
Read and return the 82489DX 
local unit task priority register 
Spi(x) 


Write x to task priority register to raise priority to x 


//Save the current task priority register value// 
//Raise the task priority value // 


// Access the Shared data structure // 


// Restore the task priority register // 
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PCICLK | | | | | | | | | 


IRDY # 


DEVSEL# [Asserted by PCEB] 


STOP# [Asserted by PCEB] 
(in general Retry indicator) 


FRAME# \ 
IRDY # \ Absence of Retry 
Event 


DEVSEL# (82489DxX logic Drives it after waiting for 
absence of Retry Event, for example, 
stop# sign from PCEB) 


TRDY# (82489DX logic 1 \ } 


Drives it to complete the cycle) 
292116-32 


Case 2) interrupt Acknowledge Cycle 
Source of the Interrupt and Vector is 82489DX 


FRAME# 


IRDY # 


DEVSEL# (PCEB 
After waiting for \ 
"external” DEVSEL ¥) — 


TRDY # (PCEB ati. sees 1 


completes the cycle) 


292116-33 


Case 3) Interrupt Acknowledge Cycle. 
Source of the Interrupt and Vector is ESC 8259 
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PC| BUS 


Control 
Lines Address Latch 


EXTINTA 


ae, 


82489DX 
82489DX 
PCI 
Interface 


PCI Data Buffer 
& 
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82489DX-PCI Interface 


82489DX AND PCI-EISA BRIDGE 
INTEROPERABILITY 


82489DX gives performance benefits in multithreaded 
operating systems. Thus in both uniprocessor and mul- 
tiprocessor systems 82489DX enhances the system level 
performance. This is because of its advantage in task 
priority management. Intel’s PCIset EISA bridge com- 
ponent PCEB (B-stepping onwards) can interoperate 
with 82489DX. In this section, we will go over the 
“hook” provided by PCEB to connect 82489DX in a 
PCI system with some external glue logic. 


From the Interrupt acknowledge timings of PCEB (B- 
stepping onwards) it can be inferred that whenever the 
internal data buffers are empty, on an interrupt ac- 
knowledge cycle, PCEB waits one clock cycle so that 
PCI interface logic for 82489DX can activate 
DEVSEL#. But, if the internal data buffers are not 
empty, then PCEB drives STOP# to retry the INTA 
cycle so that it can flush the buffers. 


If DEVSEL # is seen active and if the PCEB’s internal 
data buffers are empty, then PCEB allows 82489DX to 
own the INTA cycle. Thus, the external 82489DX glue 
logic, on an INTA cycle, should first sample STOP #. 
If STOP # is driven, then the glue logic should ignore 
the cycle. Because PCEB has some data in the buffers it 
wants to flush them before INTA cycle is run. So, the 
82489DX glue logic should not start the cycle to 
82489DX. If STOP# is not active and if the 
“ExtINTA” pin from 82489DX is inactive (to indicate 
that the cycle is for 82489DX) then it should drive 
DEVSEL# immediately to own the INTA cycle. At 
the same time it can start the cycle to 82489DX. It 
should be noted that 82489DX needs two INTA cycles 
whereas PCI bus has only one INTA cycle. So, the 
external logic is responsible for splitting one PCI INTA 
cycle into two 82489DX INTA cycles back to back but 
pass only one READY to the system. 
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During INTA cycle on PCI bus if “ExtINTA” pin is 
active (to indicate that it is 8259 INTA cycle), then the 
82489DX glue logic should not drive DEVSEL#. Thus 
by finding DEVSEL # inactive, the PCEB will respond 
to the INTA cycle. 


Thus with minimal external glue logic, it is possible to 
design an APIC based PCI system. Since PCI local bus 
will improve the I/O performance of the system, APIC 
will enhance the improvement of the overall system 
performance in a multithreaded environment. 


HARDWARE DESIGN 
CONSIDERATIONS 


Design Consideration 0 


Any edge triggered interrupt creating an active edge 
while the interrupt is masked at 82489DX is lost. The 
82489DX samples the edge triggered interrupt input 
only when it is unmasked. If an edge occurs while the 
interrupt is masked, that interrupt is lost. The software 
should always unmask the interrupt at 82489DX and 
then enable at the device. By this, it is made sure that 
82489DX will have a chance to find the active going 
edge. 


Design Consideration 1 


Description: The following design consideration has to 
be taken care of when using ISP (82357) as external 
interrupt controller. 82489DX allows connecting exter- 
nal 8259 type interrupt controller at one of its inputs. 
The mode associated with the interrupt input which has 
8259 connected to’ it is called ExtINTA mode. 
82489DX allows only EDGE TRIGGERED program- 
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ming option for ExtINTA mode. But in the case of 
82357, the INT output from ISP stays high in case 
more than one interrupt is pending at its inputs. It does 
not always inactivate its INT output after INTA cycle. 
This will lead to a situation where ISP keeps the inter- 
rupt at high level continously and waits for INTA cy- 
cle. But since 82489DX expects an edge for interrupt 
sensing (for ExtINTA interrupts) it does not pass the 
interrupt to CPU and further interrupts are lost. So 
External circuitry should monitor the end of SECOND 
CYCLE of INTA cycle and force an inactive state at 
82489DX’s input. This can be done by ANDing ISP’s 
output with a forced brief low going pulse at the end of 
second INTA cycle. This will generate an edge for each 
interrupt at 82489DX’s input. For more refined edgé 
generating logic, refer to data book, Order Number 
290446. 


Design Consideration 2 


Description: The following design consideration has to 
be taken care of when using 82489DX in EISA systems. 
EISA ISP(82357) chip integrates 8259A. It additionally 
allows sharing of interrupts. To facilitate this sharing it 
has a programmable register, ELCR (Edge / Level trig- 
ger control register) by which certain interrupt inputs 
can be programmed as edge (low to high except for 
RTC) or level (the level is active low). The determina- 
tion of edge or level is done during initial configuration 
of EISA system by reading EISA add in boards from 
the interrupt description data structures. The solution 


P 
R 
) 
G 
R 
A 
M 
M 
A 
B 
L 
E 


PRELIMINARY 


AP-388 


is to have programmable logic at the interrupt inputs so 
that 82489DX is compatible with EISA ISP. This will 
introduce one more register and logic to support this. 
This should be an 11 bit programmable register and an 
array of ExOR logic (12 ExOR gates or equivalent 
PLD). The ISP allows programmability of the follow- 
ing interrupts. It is highly recommended to use the 
same address of ELCR (and also bit definitions) for this 
polarity register, if possible, so that when ELCR is 
written this register will also be written. By this there is 
no separate programming needed for this polarity regis- 
ter. This will help to maintain compatibility with future 
APIC implementations which may use the existing 
ELCR register itself for polarity control. This is true 
for integrated APIC. 


INT3 INT4 INT5 INT6 INT7 INT9 INT10 INT11 
INT12 INT14 INT15. In addition to the above 11 in- 
terrupts, it fixes INT8 to be active low edge triggered 
interrupt. INT8 is the only case where it is active low 
edge triggered type. So the following logic can be used 
to add programmability in 82489DX based EISA sys- 
tem. Before connecting these 11 interrupt lines directly 
(#INT8 which is from Real Time Clock is always ac- 
tive low edge triggered. #INT8 can be passed through 
an inverter since there is no need for programmability) 
to the 82489DX they should pass through an array of 
11 Ex__OR gates. One input of Ex__OR gate connects 
to the corresponding INT pin and other input connects 
to a bit of programmable register. The output of 
Ex__OR gate is connected to 82489DX. The idea of 
Ex__OR is to use as a controlled inverter. 


INTIN4 
INTIN6 


INTINS 


INT11 


INTIN 10 INTIN1 1 


INT14 


INTIN 12 INTIN14 


INTIN1S 
#INT8 INT8 
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INTIN are the interrupt inputs to the 82489DX and 
INT are the system interrupt. The Ex__OR gating reg- 
ister is programmed after EISA configuration is found 
from add in boards as how these interrupt lines are 
going to be used in that particular configuration. If a 
particular input is edge triggered, then the correspond- 
ing bit in the register is written with 0. If a particular 
input is level triggered, then the corresponding bit in 
the register is written with 1. 


Design Consideration 3 


Icc bus drive is an open drain bus with drive capacity 
of 4 mA only. Since data is transmitted at each Icc 
clock, the “charging” of Icc bus should be fast enough 
to ensure proper logic level at each clock edge. The Icc 
bus needs pull up resistors since it is open drain bus. 
Since the drive is only 4 mA, the pull up resistor value 
can not be less than 5V/4mA. This being the limit of 
the resistor value, the length and the characteristics of 
the Icc trace forces a capacitance value. Both the resis- 
tor and capacitance brings a RC time constant to the 
Icc bus waveform. So, Electrical consideration has to 
be given to and practice of controlled impedence should 
be exercised for layout of the Icc bus. The length of the 
trace should be kept as minimum as possible. If the 
length of the I¢c bus can’t be kept less, than say 6 inch, 
because of mechanical design of the system, the exter- 
nal line drivers should be added to Icc bus and Ic¢c bus 
should be simulated with the added driver characteris- 
tics. 


74(ABT/F)125 


Buffered bus 
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NOTE: 
R1 can be typically 1K. 
R2 is designed from the simulation results. 


Design Consideration 4 


This is related to ADS#, BGT# and CS# timings. 
For bus cycles not intended for 82489DX, (CS# = 1 
where 82489DX is supposed ‘to sample it), any change 
in CS# line while the ADS # is still active, may errone- 
ously cause a RDY# returned from 82489DX. Ano- 
molous behavior may result if for BGT # ties low cases 


a) BGT# goes away just one clock after ADS# or 


b) ADS# is still active, and CS# changes during this 
period. 
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intel. 
For other cases anomolous behavior results if CS# 
changes when ADS # is still active. The following con- 
siderations are important from timing point of view. 
Always limit the pulse width of 82489DX ADS# to 
one CLKIN. Also avoid changing levels on BGT #/ 
CS # line, when ADS # is active for cases being identi- 
fied as BGT# tied low (BGT# sampled low- when 
ADS# goes active). Also avoid changing levels on 
CS# line when BGT # is active. 


REGISTER PROGRAMMING DETAILS 


1/O Window Register — 
(Addr[9:4] = 01 hex) 
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Data access to the register selected by I/O register se- 
lect Register. 


1/O Unit ID Register 


2.2 
43 


000000000000000000000000 


Must Be Zero 
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1/O Unit ID 


For example, for a Unit ID of 0A hex, the I/O unit ID 
register should be written with 0A00 0000. 


1/O Register Select Register 


00 


(Addr[9:4] = 00 hex) 37 


Must Be Zero 


1/O Register Select 
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PRELIMINARY 


1/O Register ; 
Register Selected 
I/O Unit ID Register 
I/O Unit Version Register 


Redirection Table[0] (31:0) 


Redirection Table[0] (63:32) 
Redirection Table[1] (31:0) 
Redirection Table[1] (63:32) | 
Redirection Table[2] (31:0) 
Redirection Table[2] (63:32) 


Redirection Table[7] (31:0) 
Redirection Table[7] (63:32) 
Redirection Table[8] (31:0) 
Redirection Table[8] (63:32) 


Redirection Table[15] (31:0) 
Redirection Table[15] (63:32) 


1/O Unit Version Register 
3 22 11 00 ) 


| ix 65 87 0 
beveeeee[eone rseonnee] 


Max. Redir Entry Version: 


indicates version no. 


Must Be Zero Must Be Zero 
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I/O Unit Version Register is read only register. It reads 
as 00O0F 0OXX where XX is version. 


Max.Redir Entry: This is equal to the number of inter- 
rupt input pins minus 1 of this I/O unit. Read as 15 in 
82489DX. 


Version: The version number that identifies this ver- 
sion. 


Redirection Table[x] (63:32) 
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Destination: If the destination mode of this entry is 
“Physical Mode’, then the 8 MSB (bits 56 through 63) 
contain an 82489DX local unit ID. 


If logical mode, then all the 32 bits (bits 63 through 32) 
of the Destination field potentially defines a set of proc- 
essors. 


| NOTE: 
The same format holds good for Redirection Tables 0 
to 15 (x = 0 to 15) for bits 63 to 32. 


If the destination is to a local unit with ID, say, 05 in 
physical mode, then the redirection table [63:32] 
should be programmed as hex 0500 0000. 


Redirection Table [63:32] should be programmed first 
before programming Redirection Table [31:0]. 


Redirection Table[x] [31:0] 


3 1 1 1 1 
1 7 


1 1 1 1 00 
7 


6 5 a 3 2 1 


0 
0 8 0 
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Bits [31:17]: Reserved. Should be written 0. 


Bit 16: MASK | 
0 — Not masked 
1 — Masked 


Bit [15]: TRIGGER MODE 
0 — Edge Triggered 
1 — Level Triggered 


Bit 14: Remote IRR Status (Read only) 
0 — Remote IRR is clear 
1 — Remote IRR is set 


Bit [13]: Reserved. Should be written 0. 


Bit 12: Delivery Status (Read only) 
0 — Idle 
1 — Send Pending 


Bit [11]: Destination Mode 
0 — Physical 
1 — Logical 


Bits [10:8] Delivery Mode 
000: Fixed 
001: Lowest Priority 
100: NMI 
101: Reset 
111: ExtINTA 


Bits [7:0] Vector 
Vector for this interrupt 


Local Unit ID Register: (Addr (9:4) = hex 02) 


3 Z@2 0 
1 43 0 


zee 000000000000000000000000 


Must Be Zero 
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Local Unit ID 


For example, for a local Unit ID of 0A hex, the local 
unit ID register should be written with 0A00 0000. 


Local Unit Version Register 
(Addr (9:4) = hex 03) 


00 
S 7 


Reserved Version 
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Local Unit Version Register is read only register. It 


reads as 0000 OOYY where YY is version number. 


Bits [7:0] Version: The version number that identifies 
this version. 


Task Priority Register (Addr (9:4) = hex 08) 


3 00 0 
1 8 7 0 


000000000000000000000000 Fad 


Must Be Zero Task Priority 
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Bits [0:7] Task Priority: Should be written with task 
priority. 


End Of Interrupt (EOI) Register 
(Addr (9:4) = hex 0B) 
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Bits [31:0]: Data written to EOI is don’t care. 


Before returning from the interrupt handler, software 
must issue an End-Of-Interrupt (EOI) command to the 
82489DX local unit. For NMI and ExtINTA and Spu- 
rious interrupts EOI SHOULD NOT be issued. 


Remote Read Register: (Addr (9:4) = hex 0C) 


3 0 
1 0 


Data Read from Remote local unit 
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The data read from remote local unit is latched in Re- 
mote Read Register. The software should qualify this 
data with “Remote Read Status bit” in the ICR regis- 
ter. 


interrupt Status Register [ISR]: 

Register Address[9:4] 
ISR[31:0] - hex 10 
ISR[63:32] hex 11 
ISR[95:64] hex 12 
ISR[127:96] hex 13 
ISR[159:128] hex 14 
ISR[191:160] hex 15 
ISR[223:192] hex 16 
ISR[255:224] hex 17 


0 
0 


SRA NaN E 
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Interrupt Status Register is read only. It marks the in- 


terrupts that have been delivered to the processor and 
waiting for EOI. 


Spurious Vector Register: (Addr (9:4) = hex OF) 


3 0 8.9 0 
1 o.8 7 0 


coe] | 


000000000 


Must be Zero 
Unit Enable 


Spurious interrupt Vector 
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Bits [31:09]: Reserved bits. Must be zero. 


Bit 8; Unit Enable: When this bit is 0, the local unit is 
disabled with regard to transmit and responding mes- 
sages on Icc bus. It only responds to messages with 
delivery mode set to “Reset”. Reading a 0 at this bit 
indicates that the unit is disabled. When a 1 is written 
to the bit, the local unit is enabled for both transmitting 
and receiving messages. Once enabled, it should not be 
disabled by software. Only further resets can take the 
unit into disabled condition. 


Bits [7:0] Spurious Interrupt Vector: For future com- 
patibility, the bits [3:0] should be written with 1111. A 
spurious interrupt service routine should be existing in 
the address corresponding to the spurious interrupt 
vector. 


Destination Format Register 
(Addr (9:4) = hex 0E) 


292116-17 


The destination format register enables logical address- 
ing by specifying the bit map in logical destination reg- 
ister. For future compatibility, all the 32 bits of Desti- 
nation Format Register should be 1. 


Logical Destination Register (LDR) 
(Addr (9:4) = hex OD) 
22 


0 
43 0 


Ot A 
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Each local unit can be addressed either physically using 
physical ID or logically using logical destination regis- 
ter. In physical addressing, either only one local unit 
can be addressed at a time or broadcast to all local units 
can be done. In logical addressing, a group of local 
units can be addressed through bit mapping in destina- 
tion addressing and the logical destination register. 


For future compatibility, bits [0:23] of the logical desti- 
nation register and bits [0:23] of the destination ad- 
dress in the message should be zero. Bits 24 through 31 
of destination information in the interrupt message re- 
ceived are interpreted as decoded field. This field is 
compared against the logical destination register of the 
local unit. If there is a bit match (i.e., at least one of the 
corresponding pair of bits of the destination field and 
LDR match) that unit is selected for interrupt delivery. 
Each bit position in the destination field corresponds to 
an individual Local unit. This scheme allows the speci- 
fication of arbitrary groups of local units by setting the 
member’s bits to 1, but allows a maximum of 8 local 
units in a system since only bits 24 through 31 (of the 
Logical Destination Register and logical destination ad- 
dress in interrupt message) are used. Broadcast to all is 
achieved by setting all 8 bits of destination to ones. This 
selects all local units in the system. 


In a very large multiprocessor system where future 
compatibility is not a main problem, all the 32 bits of 
the Logical Destination Register and all the 32 bits of 
the destination address in the interrupt message can be 
used as a bit map to address the processors. When mes- 
sage addresses the destination using logical addressing 
scheme, the local unit compares the logical address in 
the interrupt message with its own logical Destination 
Register. Thus it is possible to support 32 processors in 
logical addressing mode. 


Trigger Mode Register [TMR]: 

Register Address[9:4] 
TMRI[31:0] hex 18 
TMR[63:32] hex 19 
TMR[95:64] _ hex 1A 
TMR[127:96] hex 1B 
TMR[159:128] hex 1C 
TMR[191:160] hex 1D 
TMR[223:192] hex 1E 
TMR[255:224] hex 1F 


0 
0 


Ls dacidalatasiecbeentigmanabaal 
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If a bit corresponding to an interrpt vector number is 0, 
then it is assumed as edge triggered interrupt. For edge 
triggered interrupt, the corresponding IRR bit is auto- 
matically cleared when interrupt service starts. If 1 
(level triggered) this is not the case. Instead, the source 
82489DX (source I/O unit or Source Local Unit) must 
explicitly request the IRR bit be cleared (upon deassert 
of the interrupt input pin or upon sending an appropri- 
ate interprocessor interrupt). Upon acceptance of inter- 
rupt, the TMR bit is cleared for edge triggered inter- 
rupts and set for level triggered interrupts. This infor- 
mation was carried in the accepted interrupt message. 
The source 82489DX I/O unit also tracks the state of 
the destination unit’s IRR bit (Remote IRR bit in the 
redirection table). When a level triggered interrupt in- 
put is deasserted, the source 82489DX I/O unit detects 
the discrepancy between the input pin state and the 
Remote IRR, and automatically sends a message telling 
destination 82489DX to clear IRR for the interrupt. 


Interrupt Request Register [IRR]: 

Register 
IRR[31:0] 
IRR[63:32] 
IRR[95:64] 
IRR[127:96] 


Address [9:4] 
hex 20 
hex 21 
hex 22 
hex 23 
hex 24 
hex 25 
hex 26 
hex 27 


IRR[159:128] 
IRR[191:160] 
IRR[223:192] 
IRR[255:224] 


0 
0 


Ria A We 
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It contains the active interrupt requests that have been 
accepted, but not yet dispensed by this 82489DX local 
unit. A bit in IRR is set when 82489DX local unit 
accepts the interrupt. When TMR is 0, it is cleared 
when the interrupt is serviced; when TMR is 1, it is 
cleared when the 82489DX local unit receives a mes- 
sage to clear it. 


interrupt Command Register [31:0] (Addr [9:4] = 30 hex) 


ie uit 259 1 
09 s7.s,.3. ,4 


00000 
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1 1 1 00 0 


2 ae Ss 7 0 
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Bits [31:20]: Reserved. Should be written 0. 


Bits [19:18]: Destination Shorthand. This field indi- 
cates whether a shorthand notation is used to specify 
the destination of the interrupt and if so, which short- 
hand is used. Destination shorthands do not use the 32- 
bit Destination field, and can be sent by software with a 
single 32-bit write to the 82489DX’s interrupt com- 
mand register. Shorthands are defined for the following 
cases: Software self interrupt, interrupt to all processors 
in the system including the sender, interrupts to all 
processors in the system excluding the sender. 


00: (dest field) means that no shorthand is used. The 
destination is specified in the 32-bit Destination 
field in the second word (bits 32 to 63) of the inter- 
rupt control register. 


01: (self) means that the current local unit is the single 
destination of the interrupt. This is useful for soft- 
ware interrupts. The destination field in the inter- 
rupt command register is ignored. RESET assert 
Delivery mode should not be used with self desti- 
nation. Only FIXED delivery mode should be used 
with SELF. 


10: (all incl. self) means that the interrupt is to be sent 
to “all’’ processors in the system including the 
processor sending the interrupt. The 82489DX will 
broadcast a message with destination unit ID field 
set to all ones. RESET assert Delivery mode 
should not be used with “all incl. self’? destination. 


11: (all excl. self) means that the interrupt is to be sent 
to all processors in the system excluding the proc- 
essor sending the interrupt. The 82489DX will 
broadcast a message with destination unit ID field 
set to all ones. 


Bits [17:16]: Remote Read Status. This field indicates 
the status of the data contained in the Remote Read 
register. This field is read only to software. Whenever 
software writes to the interrupt command register using 
Delivery mode “Remote Read’ the Remote Read 
Status becomes “in progress’’ (waiting for the remote 
data to arrive). The remote 82489DX local unit is ex- 
pected to respond in a fixed amount of time. If the 
remote 82489DX local unit is unable to do so, then the 
remote read status becomes “‘invalid”’. If successful, the 
Remote Read status resolves to “Valid”. Software 
should poll this field to determine completion and suc- 
cess of the Remote Read command. 


00: (invalid): The content of the Remote Read register 
is invalid. This is the case when after a Remote 
Read command is issued and the remote 82489DX 
Local unit was unable to deliver the Register con- 
tent in time. 


01: (in progress): a remote read command has been is- 
sued and this 82489DX is waiting for the data to 
arrive from remote 82489DX local unit 
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10: (valid): the most recent Remote Read command 
has completed and the remote read register content 
is valid. 


11: reserved. 


Bit [15]: TRIGGER MODE 
0 — Edge Triggered 
1 — Level Triggered 


Software should use this bit in conjunction with Level 
Assert/Deassert to generate interrupts that behave as 
edges or levels. For future compatibility, send ICR 
messages only in edge triggered mode. 


Bit [14]: LEVEL. Software should use this bit in con- 
junction with the Trigger mode bit when issuing an 
inter-processor interrupt to simulate assertion/deasser- 
tion of level sensitive interrupts. 


Toassert: Trigger mode = 1 and Level = 1. 


To deassert: Trigger mode = 1 and Level = 0. 


For example, a message with Delivery mode of “Re- 
set’’, a trigger mode of “Level”, and Level bit of 0 deas- 
serts reset to the processor of the addressed 82489DX 
Local unit(s). As a side effect, this will also cause all 
82489DX to reset their Arbitration ID to their unit ID. 
(The Arb ID is used for tie breaking in lowest priority 
arbitration.) For future compatibility, only edge trig- 
gering should be used in ICR. 


Bit [13]: Reserved. Should be written 0. 


Bit [12]: Delivery Status (Read only) 
0 — Idle 
1 — Send pending 


Delivery status is software read-only. Software can read 
to find out if the current interrupt has been sent, and 
the Interrupt command register is available to send the 
next interrupt. If the interrupt command register is ov- 
erwritten before the Delivery status is “‘idle’’, then the 
destiny of that interrupt is undefined; the interrupt may 
have been lost. 


Bit [11]: Destination Mode 

0 — Physical 

1 — Logical 
In physical mode, a destination 82489DX is identified 
by its Local Unit ID. Bits 56 through 63 (8 MSB of the 


destination field) specify the 8-bit 82489DX Local unit 
ID. 
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In logical mode, destinations are identified by matching 
on Logical Destination under the control of the Desti- 
nation Format Register in each Local 82489DX. The 
32-bit Destination field is the logical destination. For 
future compatibility, use only bits [31:24] of the logical 
destination address. Bits [23:0] should be zero. 


Bits [10:8]: Delivery Mode _ 


000: (Fixed) means deliver the signal on the INT pin of 
all processors listed in the destination. Trigger 
mode for “fixed’’ Delivery Mode can be edge or 
level. 


001: (Lowest Priority) means deliver the signal on the 
INT pin of the processor that is executing at the 
lowest priority among all the processors listed in 
the specified destination; Trigger mode for “low- 
est priority” Delivery mode can be edge or level. 


011: (Remote Read) is a request to a remote 82489DX 
local unit to send the value of one of its registers 
over the Icc bus. The register is selected by pro- 
viding its address in the vector field. The register 
value is latched by the requesting 82489DX and 
stored in the Remote Register where it can be 
read by the local processor. A Delivery Mode of 
“Remote Read” requires an “Edge” Triggered 
mode. 


100: (NMI) means deliver the signal on the NMI pin 
of all processors listed in the destination. Vector 
information is ignored. A delivery mode equal to 
“NMI” requires a “LEVEL” Trigger mode. 


101: (Reset) means deliver the signal to all processors 
listed in the destination by asserting/deasserting 
the 82489DX local unit’s PRST output pin. All 
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addressed 82489DX local units will assume their 
reset state but preserve their ID. One side effect of 
a message with Delivery mode equal to “Reset” 
that results in a deassert of reset is that all Local 
Units (whether listed in the destination or not) 
will reset their lowest-priority tie breaker arbitra- 
tion ID to their Local unit ID. A delivery mode of 
“Reset” requires a “level” Trigger mode. “Reset” 
should not be used with “Self or “all incl.Self”’ 

- Shorthand mode since it will leave the system in 
non-recoverable reset state. If “RESET” is used 
with “all exc.Self’ mode, software should make 
sure that only one CPU executes this instruction 
in an MP system. 


Delivery mode options 010,110,111 are Intel reserved. 
They should not be used. 


Bits [7:0] Vector. The vector identifies the interrupt 
being sent. If the Delivery mode is “Remote Read”’, 
then the Vector field contains the address of the register 
to be read in the remote 82489DX’s Local unit. 


NOTE: 
In cases where Destination field in Interrupt Com- 
mand Register [63:32] is used, Interrupt Command 
Register [31:0] should be programmed only AFTER 
programming Interrupt Command Register [63:32], 
since writing to [31:0] will start sending the message. 


The following are the control words for interrupt com- 
mand register [31:0] for different modes. The interrupt 
vector, for example, is illustrated with AA hex. In the 
remote Read request command RR in the vector field 
specify address of the register to be read. The XX in the 
vector field means the vector is don’t care. 


PRELIMINARY : 


CONTROL WORD 


Fixed INT, Edge triggered int, dest. field specified 

Lowest priority INT, Edge trigg. int, dest. field specified 
Remote Read (only Edge Triggered), dest. field specified 
NMI (Only Level) Level ASSERT, dest. field specified 

NMI (Only Level) Level DEASSERT, dest field specified 
Reset (Only Level) Level ASSERT, dest. field specified 
Reset (Only Level) Level DEASSERT, dest. field specified 


Fixed INT, Edge triggered int, Se/f 

Fixed INT, Edge trigg. int, A// inclusive Self 

Lowest priority INT, Edge trigg. int, A// inclusive Self 
NMI, Level ASSERT, A// inclusive Self 

NMI, Level DEASSERT, A// inclusive Self 

Reset, Level DEASSERT, A// inclusive Self 


Fixed INT, Edge trigg. int, A// exclusive self 

Lowest priority.INT, Edge trigg. int, A// exclusive self 
NMI, Level ASSERT, A// exclusive Self 

NMI, Level DEASSERT, A// exclusive Self 

Reset, Level ASSERT, A// exclusive Self. 

Reset, Level DEASSERT, A// exclusive Self 


Interrupt Command Register [63:32] 


PHYSICAL 


Destination Mode 


0000 O0AA hex 
0000 01AA hex 
0000 O3RR hex 
0000 C4XX hex 
0000 84XX hex 
0000 C5XX hex 
0000 85XX hex 


0004 O0OAA hex 
0008 OOAA hex 
0008 01AA hex 
0008 C4XX hex 
0008 84XX hex 
0008 85XX hex 


000C OOAA hex 
000C 01AA hex 
000C C4XX hex 
O000C 84XX hex 
000C C5XX hex 
O000C 85XX hex 


Bits [63:32] Destination 
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LOGICAL 


Destination Mode 


0000 O8AA hex 
0000 O9AA hex 


NOT SUPPORTED 
0000 CCXX hex 


0000 8CXX hex 
0000 CDXX hex 
0000 8DXX hex 


0004 08AA hex 
0008 O8AA hex 
0008 O9AA hex 
0008 CCXX hex 
0008 8CXX hex 
0008 8DXX hex 


000C O8AA hex 
OOO0C O9AA hex 
000C CCXX hex 
000C 8CXX hex 
000C CDXX hex 
O00C 8DXX hex 


(Addr [9:4] = 31 hex) This field is only used when the Destination Shorthand 
field is set to “Destination Field”. If Destination field is 
physical mode, then the 8 MSB contain an Destination 
Unit ID. If logical mode, the full 32-bit Destination 
field contains the logical address. This register should 
be programmed for proper destination before program- 
ming Interrupt Command Register [31:0]. If the desti- 
nation to a local unit with ID, say, 05 in physical mode, 
then the Interrupt Command Register [63:32] should 
be programmed as hex 0500 0000. 


Bits [63:32] 
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3 a Fees. ama 1 
1 09 


OO0O0...... 00000 


Timer Base 


Timer Mode 


Bits [31:20] Reserved. Should be written Zero. 


Bits [19:18] Timer Base: This field selects the time base 
input to be used by timer. 


00: (Base 0): Uses “CLKIN” as input. 
01: (Base 1): Uses ‘“TMBASE”’. 
10: (Base 2): Uses the output of the divider (Base 2). 


Bit 17: Timer Mode: This field indicates the operation 
mode of timer. 

0 — ONE- SHOT; 

1 — PERIODIC 


In ONE-SHOT, the current count register remains at 
Zero after the timer reaches zero and software needs to 
reassign the timer’s initial count register to rearm the 
timer. 


In PERIODIC mode, when the timer reaches zero, the 
Current Count Register is automatically reloaded with 
the value in the initial Count Register, and the timer 
counts down again. 


Bit 16: Mask: This bit serves to mask timer interrupt 
generation. 


0 — Not masked; 
1 — Masked. 
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Local Timer Vector Table (Addr [9:4] = 32 hex) 
t 1 1 00 


o:- 7° § 5 > @ 1 a7 0 
a) Ge Bs 


Delivery Status 


CLKIN Input TMBASE Input Divider Input 
PAR ere (Base 0) (Base 1) (Base 2) 
PERIODIC timer, MASK cleared 0002 O0AA hex 0006 OOAA hex OO0A OOAA hex 
PERIODIC timer, MASK set 0003 OOAA hex 0007 OOAA hex 000B OOAA hex 
ONE SHOT timer, MASK cleared 0000 OOAA hex 0004 OOAA hex 0008 OOAA hex 
ONE SHOT timer, MASK set 0001 OOAA hex 0005 OOAA hex 0009 OOAA hex 


oO 


Timer Interrupt Vector 
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Bits [15:13] Reserved. Should be written Zero. 


Bit [12] Delivery Status: Delivery status indicates the 
current status of the delivery status of this interrupt. 


0 — IDLE means that there is currently no activity 
for this interrupt. | 


1 — SEND PENDING indicates that the interrupt 
has been injected, but its delivery is temporarily 
held up by other recently injected interrupts 
that are in the process of being delivered; Deliv- 
ery status is software read only. 


Bits [11:8] Reserved. Should be written Zero. 


Bits [7:0] Timer Interrupt vector: This is the 8-bit in- 
terrupt vector to be used when timer generates an inter- 
rupt. 


NOTE: 
TIMER interrupts are always treated as EDGE trig- 
gered interrupts. 


The following is the control word for various modes to 
be used in Local Timer Vector Table. For illustration 
purpose, the interrupt vector for Timer is shown as AA 
hex. 
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Local Interrupt Vector Table Registers 0,1 
1 14 00 
87 


6 5S 4 3 2 1 


1 17 


Trigger mode 
sia Remote IRR 


Register Address [9:4] 
Local IntO Vector table register 35 hex 
Local Int1 Vector table register 36 hex 


The same format applies to both Local IntO and Local 
Int1 registers. 


Bits [31:17]: Reserved: Must be Zero. 


Bit 16: MASK: 
0 — enables interrupt by clearing mask 
1 — masks the interrupt. 


Bit 15: Trigger mode: 
0 — Edge Triggered 
1 — Level Triggered 


Bit 14: Remote IRR: This bit is used for level triggered 
local interrupts. Its meaning is undefined for edge trig- 
gered interrupts. Remote IRR mirrors the interrupt’s 
IRR bit of this local unit. Remote IRR is software read 
only. 


Bit 13: Reserved. Must be Zero. 


Bit 12: Delivery Status: Software read only. Indicates 
the current status of the delivery of this interrupt. 


0 — IDLE means that there is currently no activity 
for this interrupt. 


1 — Send Pending indicates that the interrupt has 
been injected, but its delivery is: temporarily 
held up by the recently injected interrupts that 
are in the process of being delivered. 


Bit 11: Reserved. Must be Zero. 


Bits [10:8]: Delivery mode 
000 — Fixed INT 
100 — NMI 
111 — ExtINTA 
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0 0 


Delivery Status 


Delivery mode Interrupt Vector 
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All other options of Bits [10:8] are reserved. Should not 
be used. 


Bits [7:0]: Vector: This is the interrupt vector to use 
when generating interrupt for this entry. 


The following are the control words for local interrupt 
[0 as well as 1] vector tables for different modes. The 
interrupt vector, for example, is illustrated with AA 
hex. The XX in the vector field means the vector is 
don’t care. 


Interrupt Option Control Word 
Fixed INT, Edge triggered 0000 OOAA hex 
Fixed INT, Level trigg. int 0000 80AA hex 
NMI (Only Level) 0000 84XX hex 
ExtINTA (Only Edge) 0000 07XX hex 


Initial Count Register (Addr [9:4] = 38 hex) 


Bits [31:0] 
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Bits [31:0] Initial Count: Software writes to this regis- 
ter to set the initial count for timer. This register can be 
written at any time. When written, the value is copied 
to the current count Register and countdown starts or 
continues from there. The initial count register is read- 
write by software. 


Current Count Register (Addr [9:4] = 39 hex) 
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Bits [31:0] Current Count: This is the current count of 
timer. It is read only by software and can be read any 
time. 
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Divider Configuration Register 
(Addr [9:4] = 3E hex) 


0. 0 0 0 
a2 1 0 


Divide by 
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Configuration Control Word 


Divide CLKIN by 2 0000 0000 hex 
Divide CLKIN by 4 0000 0001 hex 
Divide CLKIN by 8 0000 0002 hex 
Divide CLKIN by 16 0000 0003 hex 
Divide TMBASE by 2 0000 0004 hex 
Divide TMBASE by 4 0000 0005 hex 
Divide TMBASE by 8 0000 0006 hex 
Divide TMBASE by 16 0000 0007 hex 


000000000000 


Divider Input 


Bits [31:3] Reserved. Must be Zero. 


Bit [2]: Divider Input: Selects whether divider’s input 
connects to the 82489DX local unit’s CLKIN pin or 
TMBASE. 


0 — means the divider takes its input signal from 
CLKIN. 


1 — means use TMBASE 


Bits [1:0]: Divide by: Selects by how much the divider 
divides. 


00 — divide by 2 
01 — divide by 4 
10 — divide by 8 
11 — divide by 16 


Programming Guidelines 
A) Modes of Interrupt in 82489DxX: 
Delivery Mode 


Trigger Fixed Lowest 
Mode ixee' _| Priority Reset|ExtINT 
Destination : 
Delivery 


re Pe ee oe Oe Fe 
EC a ed ea ee 


NOTE: 
@ RESET delivery mode should not be used for Local 
Interrupts. 


@ EOI should not be issued for NMI and ExtINT de- 
livery mode. 
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82489DxX Priority Space 
82489DX priority 


Priority applies among 


Does not apply to 


* Fixed Interrupts * “Reset” Interrupts 


* Lowest Priority Interrupts * "NMI" Interrupts 


* "Processor Task priority” * "ExtINT” Interrupts 


* Spurious Interrupts 
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interrupt Command Register State Diagram 


Reset 


ICR 

message 
sent to 

ICC 

Bus transmit 
unit 
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NOTE: 
*Software should check busy flag of ICR before writing 
interprocessor message. 


CONCLUSION 


82489DX has simple and powerful programming mod- 
el. It has programmable priority and it supports task 
priority in the light of interrupt priority. It reduces the 
SPLQ overhead which is very useful in uniprocessor 
system. The system performance is improved by using 
interrupt priority model to prioritize interrupts and by 
using task priority register for SPLO calls. It provides 
an easy migration path from 8259 by providing 
ExtINTA mode for DOS compatibility. 
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1.0 INTRODUCTION 


This application note discusses memory subsystem de- 
sign for the Pentium™ microprocessor. It first illus- 
trates trade-offs in memory subsystem design for a Pen- 
tium processor-based system. It then provides a design 
example of a high performance DRAM subsystem to 
highlight some of the features of the Pentium processor. 
Also covered in the example design is the bus conver- 
sion logic to interface the Pentium processor memory 
subsystem with a half speed, 32-bit, Intel486™ CPU- 
based bus to allow interface to an existing chip set. The 
example is intended to provide guidelines towards 
memory subsystem design for the Pentium processor. It 
is not necessarily the optimal cost/performance solu- 
tion. 


The Pentium processor bus characteristics presented in 
Section 2.0 and the data provided in the memory sub- 
system performance section is derived from an Intel 
internal performance simulator. Based on previous ex- 
perience with the Intel486 CPU, the simulator has 
proven to be highly accurate (<5% margin of error 
compared to the actual hardware configuration it simu- 
lates) in simulating the CPU, cache and memory per- 
formance. The performance simulation methodology is 
provided in Appendix A. Also provided in Appendix A 
is the contact information for obtaining a simulator and 
Pentium processor models to reproduce or run varia- 
tions of the data presented in this report. 


BS 

intel : 
The discussion in subsequent sections of this applica- 
tion note assumes familiarity with CPU, cache, and 
memory system architectures. For further reference in- 


formation on Pentium processor and other related doc- 
umentation, refer to the following: 


1. Intel486™ Microprocessor-Based 82350 EISA De- 
sign Guide, Order No. 296504-001 


2. 82350 EISA Chip Set Data Book, Order No. 290220- 
003 


3. Pentium™ Processor Data Book, Order No. 241428- 
001 


2.0 Pentium™ PROCESSOR BUS 
CHARACTERISTICS 


The internal cache structure of the CPU significantly 
affects the amount of external bus traffic as it filters the 
different reads, writes and prefetches before they reach 
the memory bus. The amount of bus traffic and the bus 
cycle mix is important to consider when designing an 
efficient memory subsystem. Table 1 details the Penti- 
um processor bus characteristics for different applica- 
tions. 


Table 1. Pentium™ Processor Bus Characteristics 


Memory Bus 
Utilization() 


NOTE: 
1. For Zero Wait State Memory System 
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For the bus statistics presented here and for the per- 
formance analysis in the later sections, five different 
applications were used from DOS, UNIX, and Win- 
dows environments. For simulation purposes, traces 
from each of the five different applications were fed 
through an Intel Internal (CPU-Cache-Memory) Per- 
formance Simulator to extract the desired information. 
The application traces are hardware/software generat- 
ed and consist of random samples of instructions taken 
while the actual programs were running (see Appendix 
A). These applications are described as following: 


DOS 


AutoCAD—Auto Desk’s AutoCAD program comput- 
ing and displaying a drawing. 


UNIX 


Specl—A mixture of integer SPEC89 benchmark suite 
programs running concurrently. 


Windows 


Excel (Spreadsheet)—Microsoft Excel for Windows 
running a spreadsheet calculation. 
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Word (Word Processing)—Microsoft Word for Win- 
dows converting a document for import. 


Clip (Graphics)—Clipping of graphics and text to dif- 
ferent screen regions under Windows. 


2.1 Bus Cycle Mix 


The Bus Cycle Mix for the Pentium processor is illus- 
trated in Figure 1. The result represents an average of 
the bus cycle mix for the five applications shown in 
Table 1. 


On the Pentium processor, the external bus features a 
write-back cache protocol to support its internal data 
cache. This write-back cache architecture significantly 
reduces the amont of write cycle traffic on the memory 
bus compared to a write-through cache architecture, as 
on the Intel486 CPU. However, the number of write 
cycles coming out on the bus and the ratio of writes to 
reads is still dependent on the particular application. 
The average data write-backs, shown in Figure 1, is the 
number of write cycles that occur as a result of a read 
miss in the L1 cache. 


Avg. Data Reads 


Avg. Code Prefetch 


Avg. Data Writes 


Avg. Data Write-backs 


36.00% 
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Figure 1. Bus Cycle Mix for the Pentium™ Processor 
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2.2 Bus Utilization 


Table 1 showed the L1 hit rates for different applica- 
tions. The higher the hit rate in the L1 cache, the less 
amount of time the CPU has to spend on the external 
bus. Bus Utilization refers to the amount of time, out of 
the total time taken to execute the application, that the 
CPU has to spend executing cycles on the external bus. 
Bus utilization is not only dependent on the application 
but also on the memory subsystem. Faster memory sys- 


S | 

intel ° 
tems allow the bus cycles to complete faster and there- 
fore reduce the CPU bus utilization. Table 2 shows the 


Pentium processor bus utilization for three different 
memory subsystems. 


As seen from Table 2, the memory bus utilization in- 
creases as the memory gets slower; also since the DOS 
application (AutoCAD) has a higher hit rate in the L1 
cache compared to the other applications, its bus utili- 
zation numbers are also lower than the rest. 


Table 2. Pentium™ Processor Memory Bus Utilization for Three Different Memory Subsystems 


J |S (Autocad) | UNIX(Spect) | __ Windows (Excel 


Zero Wait State 
RPH = 2-1-1-1 

RPM = 2-1-1-1 
WPH = 2, WPM = 2 
Write Burst = 1-1-1 


Fast System 

RPH = 5-2-2-2 
RPM = 11-2-2-2 
WPH = 3, WPM = 7 
Write Burst = 2-2-2 


Slow System 

RPH = 9-4-4-4 

RPM = 17-4-4-4 
WPH = 6, WPM = 14 
Write Burst = 4-4-4 


NOTE: 


RPH = Read Page Hit, RPM = Read Page Miss, WPH = Write Page Hit, WPM = Write Page Miss. Timing is given in CPU 


clocks at 66 MHz. 
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3.0 MEMORY SUBSYSTEM 
PERFORMANCE 


The performance of the memory subsystem has a direct 
relation to the overall performance of the system. This 
chapter will focus on the impact the memory subsystem 
has on the overall performance of the Pentium proces- 
sor. First, the simulated performance of the DRAM 
subsystem for the Pentium processor covered in the de- 
sign example portion of this application note will be 
presented. This will be followed by examining the im- 
pact of varying different parameters in the design exam- 
ple and how they affect the overall performance of the 
Pentium processor. To examine these effects, the traces 
captured from the applications described in Section 2.0 
were fed through the performance simulator and con- 
figurations varied to measure the performance under 
different conditions. 
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3.1 Overall Performance 


Figure 2 shows the simulated performance of the Penti- 
um processor using the memory subsystem presented in 
the design example. The performance is characterized 
for five different applications and compared relative to 
an ideal zero wait state memory system. The perform- 
ance level of the ideal memory system (i.e. with an infi- 
nite L1 cache) is the highest achievable and therefore 
normalized to 100%. Table 3 provides the parameters 
that were used in simulating the performance of this 
system. 


The simulated performance of this memory subsystem 
ranges from 80% to 94% relative to an ideal memory 
system. The relative performance is lowest for Excel 
running under Windows as it has the lowest hit rate in 
the L1 cache compared to the other applications. CLIP 
on the other hand achieves the highest hit rate in the L1 
cache relative to the rest of the applications and there- 
fore shows the best performance. 


Table 3. Configuration Parameters for the Pentium™ Processor DRAM Subsystem Design Example 


NOTE: 


TS iguration Parameters 
[Ting witebackPagert ———sCi~dtCiee 


Timing is given in CPU clocks; 5-2-2-2 refers to 5 clocks for lead off-cycle and 2 clocks for each successive access of the 
burst cycle. For further details refer to the design example section. 
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Figure 2. Overall Performance of the Pentium™ Processor Memory Subsystem Design Example 


3.2 Impact of Wait States on 
Performance 


This section examines the impact of wait states on over- 
all performance of the Pentium processor. As wait 
states are added to the different read and write cycles, 
the number of clocks required to execute an application 
increases and the overall performance decreases. Three 
sets of simulations were run (one application from each 
operating system environment) in which the timing pa- 
rameters from only one category were changed while 
the other parameters were kept constant. This was done 
so as to analyze the dependency of Pentium processor 
performance on each of the different timing parame- 
ters. The memory subsystem parameters shown in Ta- 
ble 3 were used as the base in the analysis. 
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3.2.1 READ CYCLE PERFORMANCE 


The different memory timing parameters that are of 
interest in analyzing the read cycle performance are: 


® Clock latency on read lead-off cycle (i.e., the num- 
ber of clocks required to complete the first access of 
a burst read cycle or single read cycle). 


® Clock latency on burst read cycles (i.e., the number 
of clocks required to complete the subsequent ac- 
cesses of a burst read cycle). 


Figure 3 shows the effects of burst read cycle timing on 
performance. It shows the difference in performance 
relative to a zero wait system as wait states are added. 
Only the burst read timing parameters are varied while 
the other parameters are kept constant as in Table 3. 
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Figure 3. Effects of Burst Read Cycle Timing on Performance 
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Figure 4. Effects of Read Lead-Off Cycle Timing on Performance 
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Figure 4 shows the effects of read lead-off cycle timing 
on performance. Again only the read lead-off cycle tim- 
ing parameters were varied while the others parameters 
were kept constant in running the simulations. Also 
note that the numbers shown on the x-axis of Figure 4 
correspond to lead-off cycle timing for pipelined page 
hit reads, however in running the simulations, wait 
states were added to lead-off cycle of all read parame- 
ters (i.e., non-pipelined page hit reads, pipelined and 
non-pipelined page miss reads, etc.). 


From Figure 3 and Figure 4 we can notice that, as we 
would expect, the relative performance decreases as 
wait states are added to each of the timing parameters. 
The read burst timing however shows more sensitivity 
to wait states than read lead-off cycle timing. This is 
because every wait state on a burst read normally af- 
fects four transfers and ties up the bus the most, while 
the lead-off timing parameter only affects the timing for 
the first access and therefore has less impact on overall 
bus utilization. Among the three applications, Auto- 
CAD running under DOS showed the least sensitivity 
to wait states (approximately 2% change per wait state 
for burst reads) because of its higher hit ratio in the L1 
cache. Excel under Windows on the other hand was 
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more sensitive (3%-—5% change per wait state for burst 


reads) because it required the external bus more fre- 
quently than the rest. 


3.2.2 WRITE CYCLE PERFORMANCE 


For the write cycle performance, the timing parameters 
of interest are: 


@ Clock latency on write cycle (i.e., the number of 
clocks required to complete a write cycle to memo- 
ry). 

@ Clock latency on burst write cycles (i.e., the number 
of clocks required to complete each subsequent por- 
tion of a burst write (write-back) cycle). 


Figure 5 and Figure 6 show the effects of write cycle 
and burst write cycle timing on performance respective- 
ly. Note in running the simulations, only the individual 
parameters of concern in each case were varied while 
all others were kept constant (as shown in Table 3). 
Also note that in simulating the write cycle perform- 
ance, wait states were added to all the write cycle pa- 
rameters (pipelined and non-pipelined page hit and 
page miss writes), however only the parameters for 
page hit writes are shown in Figure 5. 
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Figure 5. Effects of Write Cycle Timing on Performance 
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Figure 6. Effects of Burst Write Cycle Timing on Performance 


Compared to read cycles, write cycles do not show as 
much sensitivity to wait states. This is mainly attribut- 
ed to the write-back cache architecture of the Pentium 
processor’s internal data cache which reduces the num- 
ber of write cycles coming out on the external bus. It is 
also related to the fact that write cycles get posted to 
the write buffers internal to the Pentium processor, and 
the CPU does not get stalled unless there is another 
miss to L1 before the write buffer is emptied. 


For burst write cycles, an additional wait state hardly 
impacts overall performance. This is attributed to the 
fact that write bursts (due to a read miss in the Ll 
cache) only constitute a small percentage of overall bus 
activity (see Figure 1) and never stall the CPU engine. 
In systems which involve heavy snooping activity, the 


number of write bursts may constitute a larger percent- © 


age of overall bus activity and therefore show a more 
significant impact to overall performance. 
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3.3 Effects of DRAM Interleaving on 
Performance 


Interleaving involves using more than one bank of 
DRAM to effectively increase overall performance, 
with each bank normally having the same width as the 
memory bus. Each bank in the interleaved system is 
controlled separately. Thus, during an access to one 
bank, the other bank is made ready for the next sequen- 
tial access. In this manner, data transfers can be carried 
out from alternate banks to achieve a faster burst trans- 
fer rate and increase the overall performance. Adding 
additional banks to the DRAM array however adds to 
the memory logic and minimum memory requirements 
and therefore involves a cost-performance tradeoff. 
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Figure 7. Effects of DRAM Interleaving on Performance 


Figure 7 shows the effects of the DRAM interleaving 
factor (single bank versus a 2 bank and a 4 bank imple- 
mentation) on the overall performance of the Pentium 
processor. The bus timing assumptions for the different 
interleaved systems are as follows: 


1 Bank— 4 clock burst read timing, 3 clock burst write 
timing. 

2 Bank— 2 clock burst read timing, 2 clock burst write 
timing. 

4 Bank— 1 clock burst read timing, 1 clock burst write 
timing. 


All other parameters remain constant as shown in Ta- 
ble 3. 


The overall performance is compared relative to a zero 
wait state memory system. The results show that the 
single bank suffers as much as 9% loss in relative per- 
formance compared to a 2 bank system. While the 4 
bank system adds 2% -4% to the overall performance 
for different applications over the 2 bank system. Since 
4 bank interleaving adds cost to the memory subsystem 
by requiring additional logic and increasing the mini- 
mum DRAM size, the 2 bank interleaved system is 
therefore the best cost/performance option for the Pen- 
tium processor desktop. 
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3.4 Effects of DRAM Paging on 
Performance 


DRAM paging provides faster accesses for successive 
data transfers that occur within the same page. It is 
typically used for all type of accesses (read, write, 
write-backs) in order to increase the overall perform- 
ance. The assumption here is that the Row Address 
Strobe (RAS) signal remains asserted continuously and 
is deasserted only when there is a page miss. 


Figure 8 shows the effects of DRAM paging on the 
overall performance of the Pentium processor. In this 
figure, a paged DRAM system with an 8K byte and a 
16K byte page size is compared to a non-paged DRAM 
system and the performance comparison is relative to a 
zero wait state memory system. The timing assump- 
tions for the non-paged memory system are as follow- 
ing: 


Burst Read Cycle = 8-2-2-2; 


Burst Write Cycle = 6-2-2-2; 
Write Cycle = 6 clocks 
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Figure 8. Effects of DRAM Paging on Performance 


Table 4 shows the average page hit/miss ratios for each 
of the three applications in Figure 8. The results indi- 
cate that a non-paged system may suffer anywhere from 
3% to 7% loss in performance compared to a paged 
system. This is due to the high page hit ratio for each of 
the applications that were simulated, as shown in Table 
4 (shown for 8 Kbyte page size). However increasing 
the page size from 8 Kbyte to 16 Kbyte had no signifi- 
cant impact on overall performance. 


3.5 Effects of Memory Bus Pipelining 


The Pentium processor supports a bus pipelining fea- 
ture for read and write cycles which allows it to drive 


another cycle before the current one is completed. This 
provides faster timing for back to back accesses and 
results in increased overall performance. 


The simulations to understand the performance gain 
achieved from using this feature were carried out for 
three different types of memory subsystems. The cycle 
timings for these memory subsystems are detailed in 
Table 5. 


Table 4. Average Page Hit/Miss Ratio for Three Applications 


Prefetch Page Hit Ratio 
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Table 5. Memory Assumptions for Pipelining Simulations 


Write Page Hit 


Write Page Miss 


NOTE: 
Timing is given in CPU clocks at 66 MHz 


Figure 9 shows the amount of performance gain 
achieved by using the pipelining feature in each of the 
three different memory systems. The slower the memo- 
ry subsystem, the more it benefits from pipelining mem- 
ory accesses. As seen from Figure 9, the “Slow”? memo- 
ry achieves the highest performance gain, over 8% in- 
crease, with the help of pipelining. A typical system 
may gchieve anywhere from 3% to 6% gain in overall 
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performance by implementing the memory bus pipelin- 
ing feature of the Pentium processor. 


Table 6 shows the average number of bus cycles that 
get pipelined for each of the three different memory 
systems. It also proves the fact that the slower the 
memory subsystem, the higher the chance that cycle 
pipelining will occur. 
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Figure 9. Effects of Memory Bus Pipelining on Performance 
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Table 6. Average Pipelining Ratio for Different Memories 


3.6 Memory Bus Width and Bus 
Speed 


The memory bus width and bus speed also affect the 
memory latency time and can impact overall perform- 
ance. The Pentium processor is designed with a 64-bit 
data bus. Therefore it expects all data transfers to occur 
in 64 bits. Similarly the bus is designed to support a 
frequency of 66 MHz. Reducing the bus speed to 
33 MHz would require extra clocks to complete a bus 
cycle and will increase the bus utilization, thereby re- 
ducing the overall performance. 


5S & 8 
29 & 


75.00% 


70.00% 


Performance Relative to 0 WS memory system (%) 


[. OWSMemory | _FastMemory | Stow Memory 
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Figure 10 shows the impact to overall performance in 
going from a 64-bit bus to a 32-bit bus, while Figure 11 
shows the impact to overall performance in reducing 
the bus frequency from 66 MHz to 33 MHz. The as- 
sumptions for the timing of the memory systems for 32- 
bit bus and 33-MHz bus speed are given in Table 7. For 
comparison with the normal case (i.e., full speed bus, 
64-bit data bus width), the parameters provided earlier 
in Table 3 were used. 2 if 
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Figure 10. Effects of Bus Width on Performance 
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Figure 11. Effects of Bus Speed on Performance 


The simulation results showed that reducing the bus 
width to 32 bits resulted in 9%-—14% loss in perform- 
ance relative to a 0 WS memory system. Reducing the 
bus speed in half also reduced the overall performance 


relative to 0 WS memory to go down by 8%-10%. 
Excel under Windows showed more sensitivity because 
it used the memory bus more often due to the lower L1 
cache hit rate. 


Table 7. Assumptions for Memory Bus Speed and Bus Width Simulations 


Cycle Type Non-Pipelined 
Read Page Hit 4-1-1-1 


Write Page Miss 
Write Burst Page Miss 


4.0 AN EXAMPLE SYSTEM 
ARCHITECTURE OVERVIEW 


The next section will work with an example to discuss 
memory subsystem design for the Pentium processor. 
The block diagram for an example system architecture 
is shown in Figure 12. The assumptions for the system 
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architecture are as follows: It is designed without a sec- 
ond level cache, with the memory subsystem residing 
on the CPU (memory) bus, running at full speed, with a 
64-bit data path to the Pentium processor. The memory 
subsystem portion consists of a two bank interleaved 
DRAM array that supports DRAM paging, memory 
cycle pipelining and CPU write-back operation. In or- 
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der to provide interface to an existing system chip set, 
the bus conversion logic provides the necessary logic 
required to interface to the Intel 82350 EISA chip set as 
shown in the block diagram. The discussion in the fol- 
lowing sections will focus on detailing the memory sub- 


system and the bus conversion logic that is required to 
interface to a 32-bit, 33-MHz bus. 


PENTIUM™ 
PROCESSOR 


AP-478 


5.0 DRAM SUBSYSTEM OVERVIEW 


This section presents a functional overview of the ex- 


ample memory subsystem. A block diagram of the 


memory subsystem is shown in Figure 13. The subsys- 
tem has been designed to support as many functions of 
the CPU as possible, such as, allowing CPU to operate 
in write-back mode; supporting bus pipelining; support 
of burst read and burst write cycles, etc. The other at- 
tributes of the memory subsystem include DRAM pag- 
ing support and two bank interleaving. 
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Figure 12. System Architecture Overview 
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Figure 13. Block Diagram of the Memory Subsystem 


The memory subsystem consists of the following 
blocks: DRAM array, address path logic, page address 
comparison logic, data path logic and control logic. 


Since the design employs the interleaving technique for 
faster accesses during burst transfers, it alternates be- 
tween the 64-bit banks. The particular bank that is ac- 
cessed is determined by the value of the address line 
A3. Even data (A3 = 0) is stored in bank 0, while odd 
data (A3 = 1) is stored in bank 1. During burst trans- 
fers, cycles are overlapped in order to hide the memory 
latency for subsequent accesses and in this manner 2 
clock timing is achieved for burst transfers. 


In order to support the address pipelining feature of the 
Pentium processor, address latches are used to latch the 
Pentium processor address as soon as a bus cycle is 
started. Once the address is latched, the control logic 
asserts the NA # signal to allow the Pentium processor 
to issue another cycle before the previous cycle gets 
completed. This effectively reduces the lead-off cycle 
time by 2 to 3 clocks for back to back transfers and 
improves overall performance. 


Multiplexor logic is used to drive the appropriate row 
and column addresses to the DRAMs. Separate address 
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multiplexors are required for each bank to support the 
interleaving function as well provide the necessary 
drive required for each bank. 


The control logic in this example design is implemented 
with EPLDs. Around 16 programmable logic devices 
are employed for the DRAM controller function. This 
includes the RAS (Row Address Strobe), CAS (Col- 
umn address Strobe), WE (Write Enable) generation, 
along with Decode logic for DRAM cycle control; in 
addition to CPU Cycle Control, Data and Address 
Path Control, Burst Address Generation, Page Hit 
comparison and Row Decode functions. 


For the data path logic, separate registered transceivers 
are used for each DRAM bank in order to transfer the 
data between the CPU and the DRAMs. These data 
transceivers are enabled/disabled appropriately to 
switch between bank 0 and bank 1 during bursting. The 
registered function of these transceivers is used for sup- 
porting the pipeline operation for write cycles. Write 
data coming out of the CPU is latched in these registers 
and then written to DRAMs while the CPU is allowed 
to issue another cycle. In this manner, write cycles can 
complete in three clocks and incur additional wait 
states only if there is a page miss. 
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Details on each of the blocks that make up the DRAM 
subsystem will be covered in subsequent sections. 


5.1 DRAM Organization 


The DRAM array is organized in two banks of 64-bit, 
interleaved memory. Each bank is two rows deep 
(Rows A and B). Figure 14 shows how the DRAM 
array is organized. 


Each of the two banks require 60 ns Fast Page Mode 
DRAMs in single density 256K x 36 or double density 
512K x 36 memory module configurations. The mini- 
mum memory configuration for this memory subsystem 
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512K x 36-60 


A0-A8 


AP-478 


is 4 Mbytes using four 256K x 36 memory modules (2 
modules per bank); and the maximum is 16 Mbytes 
using eight 512K x 36 memory modules (4 modules per 
bank). With a few restrictions a combination of single 
and double density modules is also permitted. 


The DRAM control signals, RAS#, CAS#, WE#, 
and address lines are distributed according to the par- 
ticular bank, the row within the bank, and the loading 
on these signals, There are a total of 16 CAS# signals, 
8 RAS # signals and 4 WE# signals generated to cover 
the entire DRAM array. The address lines are distrib- 
uted such that each single line covers only one row per 
bank. 
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Figure 14. Organization of the DRAM Array © 
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Figure 15. Address Path Logic 


5.2 Address Path Logic 


The memory subsystem is designed to support the bus 
pipelining feature of the Pentium processor. In order to 
allow the CPU to issue the next cycle before the previ- 
ous gets completed, the DRAM subsystem requires 
latching the CPU address, byte enables and other cycle 
definition signals. The latching function is implemented 
with four 74FCT573 octal transparent latches. The 
latch enable input on these latches is controlled by the 
ALE signal, which is generated by the control logic 
(PLD 13). When ALE is high, the latch appears trans- 
parent. When ALE goes low at the start of a bus cycle, 
the address and other control signals get latched and 
remain latched until the cycle is complete. This pre- 
vents the signals corresponding to the new cycle from 
reaching the multiplexors and affecting the cycle in 
progress. 


As shown in Figure 15, the row and column address 
corresponding to the memory bus cycle, flow through 
the transparent latches and reach the address multiple- 
xors for each bank. Four 74F257 quad 2 to 1 multiple- 
xors are used for driving the appropriate row and col- 
umn addresses to the DRAMs. The multiplexors are 
separate for each bank, which is required for the inter- 
leaving function. 
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The select function for the multiplexors is controlled by 
the Row Select signal. Whenever Row Select goes low 
(i.e., when RAS# goes active, and after the row ad- 
dress hold time is satisfied, the address path to the 
DRAMs switches from the row address to the column 
address). Anytime that a DRAM page miss is encoun- 
tered, RAS# is deasserted and the Row Select signal 
switches the address multiplexors to allow the new row 
address to flow through to the DRAMs. 


Each address line coming out of the multiplexor then 
flow through buffers (74F244) and is duplicated in or- 
der to separately drive each of the different rows within 


~ each bank of the DRAM array. This is done so as to 


reduce the amount of loading on each of the DRAM 
address lines. 


To support burst data transfers, the memory subsystem 
generates the burst address corresponding to Pentium 
processor’s burst order. The Pentium processor only 
drives the first address of a burst cycle, while the mem- 
ory subsystem generates the addresses for the remain- 
ing transfers of the burst. In this design, the control 
logic (PLD 14) generates the burst address and drives 
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the lowest address line to the DRAMs. This PLD also 
implements the multiplexing function for address line 
AO to the DRAMs such that AO corresponds to CPU 
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Figure 16. DRAM Page Address Comparison Logic 


Figure 16 shows the DRAM page address comparison 
logic. This logic compares the address of the current 
cycle with that of the previous cycle and generates a 
page hit signal. If the current cycle’s address is to the 
same DRAM row (page) as the previous cycle, then the 
page hit signal is activated. In case of a page miss, the 
control logic, in response to the page hit signal being 
inactive, deasserts RAS # to satisfy the RAS precharge 
time and also supplies a new row address to the 
DRAMs. For back to back page hit cycles, RAS# re- 
mains active and thereby avoids the min. RAS# to 
CAS # active delay for each cycle. 


An 8-bit identity comparator, 74FCT521B, is used for 
comparing the address lines A13—A20. The address 
lines A21—A23 are compared using a PLD. This PLD 
also checks whether the current row matches the previ- 


PRELIMINARY 


ous row.and bank and finally combines all the separate 
compare output signals to generate the DRAM page hit 
signal. 


The page hit detect PLD (PLD 9) also performs the 
row decode function. The jumpers MSIZA and 
MSIZB, as shown in Figure 16, set the appropriate 
memory size depending on how the DRAM banks are 
populated. MSIZA corresponds to row A, while 
MSIZB corresponds to row B. If MSIZA is set to zero, 
then row A is set for 256K x 36 bit modules, while 
MSIZA equal to 1 sets row A in each bank for 512K x 
36-bit modules. The information provided by these 
jumper settings along with the status of address lines 
A22 and A23 determine the particular row that is ac- 
cessed and is subsequently used by the RAS decode 
logic to strobe the appropriate row. 
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5.3 Data Path Logic 


The data path logic (including data parity) for the 
memory subsystem is implemented with 74FCT646A 
bidirectional registered transceivers, as shown in Figure 
17. Since each component is 8 bits in width, a total of 
nine chips are required to support the 64-bit data path 
and parity for each bank in the DRAM array. 


The registered function of the the ’646A parts is re- 
quired to interleave and pipeline write cycles. The write 
data from the CPU is held in the registers for each bank 
while the next cycle starts. In this manner three clock 
timings can be achieved for single write cycles and two 
clock timings for subsequent transfers of a burst write. 
See memory cycle timing section for further details. 


The direction control for the transceivers is controlled 
by the W/R # signal. The other controls, output enable 
(G#) and clock pulse inputs (CP), are controlled by 
the DENO/1 and DCP0/1 signal respectively which are 
generated by the control logic. DEN enables the out- 
puts on the transceivers for each bank appropriately 
during read and write cycles. A low to high transition 
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on the DCP signal allows the write data to be stored in 
the registers during write cycles. 


During burst read cycles, data transfer is alternated be- 
tween the two banks. As separate data path logic is 
used for each bank, the data logic for one bank must be 
tristated during access to the other bank. In order to 
minimize and prevent bus contention during burst read 
cycles, the control PLD for DEN (PLD 10) does not 
enable the outputs of one bank and disable the outputs 
for the other bank at the same time. Even though the 
output disable time for the ’646A parts is faster than 
the enable time, the control logic introduces a slight 
delay in the output enable signal in order for the out- 
puts of the other bank to be disabled to avoid conten- 
tion. 


An alternate method for implementing the data path 
logic is to use fast 2 to 1 multiplexors to switch between 
bank O and bank 1 data. This however increases the 
component count as separate registers are required for 
each bank to hold the write data in order to support the 
interleaving and pipelining functions for write cycles. 
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Figure 17. Data Path Logic 


5.4 Control Logic 


In this design, the logic responsible for generating all — 


the signals necessary to control the memory subsystem 
is implemented in PLDs. Figure 18 shows how the con- 
trol logic is distributed. 


The first level of logic is made up of the memory bus 
cycle control for the CPU. The first level samples the 
cycle definition signals from the CPU and generates the 
appropriate signals to the second level of logic for 
DRAM cycle control. The second level therefore waits 
for the cycle start signal from the first level before initi- 
ating a DRAM cycle. Distributing the logic in this 
manner reduces the amount of loading on the CPU 
cycle definition signals, especially ADS #. It also means 
that the decode outputs (such as DRAM Chip Select 
signal and Page Hit signal) have an extra clock before 
they are sampled by the second level PLDs, therefore 
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more decode time is allowed. The first level logic is also 
responsible for generating the address latch enable sig- 
nal and NA # signal back to the CPU for cycle pipelin- 
ing. 


The second level of logic generates the master signals 
such as RAS# and CAS# required for the DRAM 
access, in addition to signals that control the data trans- 
fer between the DRAM subsystem and the memory 
bus. As shown in Figure 18, one PLD generates the 
RAS# signal while one PLD each is dedicated to gen- 
erating the CAS# and Write Active indication signals 
for each bank. 


The RAS# Generation PLD works in conjunction 
with the RAS# decode PLD, which also implements 
the RAS# Precharge Count function, to keep RAS# 
continuously active for all CPU cycles as long as a page 
miss is not encountered, a RESET does not occur, or 
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the memory bus is not relinquished in response to a 
HOLD request. Whenever another bus master (EISA, 
ISA, DMA) requests a DRAM transfer, the processor 
is put on HOLD and RAS# is deasserted. RAS# is 
asserted for the non-CPU generated cycle appropriately 
and deasserted on completion of that cycle. RAS# is 
then asserted again as soon as the CPU is given control 
of the bus and a DRAM transfer is initiated. Refresh in 
this design is implemented as a “RAS Only Refresh.” 
Every time the integrated system peripheral (ISP) gen- 
erates a refresh request for the DRAM (once every 15 
ps), RAS# is asserted without activating the CAS# 
signal. The refresh cycle is run as a standard EISA 
cycle. 
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The CAS# generation PLDs control the strobing of 
the CAS# signal to the DRAMs in each bank for read 
and write cycles. During burst transfers, data accesses 
are alternated between the two banks and the CAS# 
signals are asserted and deasserted appropriately by this 
logic. During write cycles, due to pipelined implemen- 
tation, the Write Active indication signal stays active as 
long as the corresponding CAS # signal is active to pre- 
vent any new cycle to the same bank from interfering. 


The second level logic also includes a PLD to generate 
the CPU BRDY # signal. Separate BRDY # signals are 
generated for read and write cycles. These signals are 


| combined with the system RDY# signal by a PLD in 


the third level before being input to the CPU. Other 
PLDs that make up the second level generate the Bank 
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Figure 18. DRAM Subsystem Control Logic 
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Select and Burst Count Tracking signals along with the 
data tranceiver control signals for data path control. 


The third level of logic is mainly combinatorial logic. 
This includes the Row Decode, RAS# Decode, CAS # 
decode and Write Enable Generation signals for each 
bank, and BRDY # combine logic. During write cycles, 
the CAS # decode logic qualifies the CAS # signal with 


the byte enable signals such that only those bytes for - 


which the corresponding byte enable signals are assert- 
ed are written into. 


For the Address Decode function, an 8K x 8 SRAM is 
used. This SRAM performs the DRAM Chip Select, 
Write Protect, and Cacheable Address Decode func- 
tions. Its implementation and programming is dis- 
cussed in reference [1] (Jntel486™ Microprocessor- 
Based 82350 EISA Design Guide). The only difference 
in this design is that a faster SRAM chip is used, 10 ns 
instead of 15 ns, to allow the decode outputs to be 
available for sampling at the end of the second clock of 
the CPU bus cycle. 


6.0 BUS CONVERSION LOGIC 


In this example design, the memory subsystem resides 
on the CPU bus, running at full speed (66 MHz) with a 
64-bit data path interface to the CPU. This section will 
examine the address/data bus conversion logic required 
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to interface this memory subsystem with the 82350 
EISA chip set which is designed for a 33-MHz host bus 
interface and 32-bit data path. The address/data con- 
version logic is shown in Figure 19. It consists of Ad- 
dress Buffer logic, Data Bus Buffer logic, Data Parity 
Translate and Byte Enable Translate logic. 


The address buffers are required to isolate the CPU 
address bus from the 82350 host address bus during 
CPU writeback cycles. Four 74F645 buffers are used 
for the 32-bit address bus. During EISA, ISA, DMA 
master activity on the memory bus, the CPU is snooped 
for every read or write cycle to DRAM. In case of a 
snoop hit to a modified line (which would require a 
writeback cycle), the HITM# signal goes active and 
the output buffers on the ’645 chips are turned off to 
allow the CPU to drive the address for the writeback 
cycle on the memory bus. In all other cases, the outputs 
on these buffer chips stay turned on. 


For the 64-bit to 32-bit data path interface, two EBB 
(EISA Bus Buffer) chips strapped in “data without par- 
ity” mode are used. This logic can alternatively be im- 
plemented with discrete logic using 74F543 and 
74ALS8245 chips. However the use of EBB chips reduc- 
es the overall chip count and board space as each EBB 
chip is able to provide a 32-bit interface. 
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Figure 19. Address/Data Bus Conversion Logic 
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The controls for these data buffer chips is provided by 
the EDOE# and HDOE# signals. The HDOE# sig- 
nal activates the output buffers on the CPU bus side 
during CPU I/O read cycles and write cycles generated 
by other bus masters (EISA/ISA/DMA) in the system. 
In case of a non-CPU generated write cycle to DRAM, 
the output enables on both high and low (Dwords) 
EBBs are enabled simultaneously and exactly the same 
data is duplicated at the outputs and appears on the 
CPU bus side. It is the byte enable signals that control 
which byte within the 64-bit word gets written into the 
DRAM array. For CPU generated I/O write cycles 
and EISA/ISA/DMA master generated read cycles to 
DRAM, the output buffers on the host bus side are 
enabled by the EDOE# signals. Depending on the 
status of the HA2 (Host Address line 2) signal, either 
the high 32-bits or the low 32-bits from the 64-bit bus 
appear on the 32-bit host data bus. 


For the Data Parity and Byte Enable translation be- 
tween the 64-bit bus and the 32-bit bus, two 85C060 
PLDs are used. Each of these chips is designed with 16 
I/O pins and therefore convenient to use for this pur- 
pose. The Byte Enable translation PLD converts the 8- 
byte Enables corresponding to a 64-bit bus appropriate- 
ly into 4 Byte Enables corresponding to a 32-bit bus 
and vice versa. Similarly the Data Parity translate PLD 
does the same for the Data Parity signals. The output 
enables for the I/O pins on these chips is controlled by 
the HLDA and HITM # signals from the CPU. During 
CPU generated I/O cycles, identified by HLDA inac- 
tive, the output buffers on the 82350 host bus side are 
enabled. During all other cycles, except CPU writeback 
cycles to DRAM, the output buffers on the CPU bus 
side are enabled. 
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6.1 Synchronization of Control 
Signals 


The synchronization of signals between the Pentium 
processor (66 MHz) and the EISA bus controller 
(33 MHz) are performed by two PLDs. One of the 
PLDs takes some of the CPU output signals as inputs 
and regenerates these for the 33-MHz bus. The other 
PLD synchronizes the input control signals going to 
the CPU from the 33-MHz bus. 


Two of the Pentium processor’s output signals, ADS#, 
and PCHK *#, are available for only one clock coming 
out of the CPU at 66 MHz. These have to regenerate 
for the 33-MHz bus in order for the bus controller to 
sample them. Figure 20 shows how the ADS # signal is 
synchronized and regenerated for the 33-MHz bus. If 
the ADS# signal comes out relative to the falling edge 
of the 33-MHz clock (label A), the output HADS# 
gets sampled at the next rising edge of MCLK (ie., 
next rising edge of CPU clock). However, if ADS# is 
output relative to the rising edge of MCLK (label B), 
the next rising edge of MCLK coinciding with rising 
edge of CPU clock happens two CPU clocks later as 
shown. The synchronization of the PCHK # signal is 
performed identical to the ADS# signal. 


The other PLD handles the synchronization of CPU 
input signals like EADS#, HOLD, FLUSH# and 
BRDY # coming from the EBC before they are input 
to the Pentium processor. 


7.0 MEMORY CYCLE TIMING 


This section details the cycle timing for the example 
memory subsystem. Different memory bus cycles will 
be illustrated here to explain the function of the memo- 
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Figure 20. Synchronization of ADS# from 66-MHz Bus to 33-MHz Bus 
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ry control logic. Note that the bus cycles shown in this 
section are specific to this example and are not restrict- 
ed to the timing provided here. Please refer to the Pen- 


tium™ Processor Data Book, for the description and 
timing of the CPU specific signals. 


Table 8 details the clock timing for different type of 
read and write cycles that apply to this memory subsys- 
tem. These will be explained in subsequent sections. 
Note, the numbers shown indicate the actual number of 
CPU clocks that would be required to complete the bus 
cycles. For example 5-2-2-2 refers to five clocks for the 


first access and two clocks for each subsequent access | 


of the burst. 


7.1 Read Cycle Timing 


7.1.1 NON-PIPELINED READ CYCLE AFTER 
RESET 


Figure 21 shows a CPU generated read cycle to DRAM 
‘that is started after a RESET or DRAM Refresh. In 
this type of cycle, the DRAM access is started in the 
third clock with the activation of the RAS# signal. 
Since this memory subsystem is designed to support 
paging, RAS# is normally kept active unless there is a 
page miss to the DRAMs (explained later). The actual 
number of clocks that are required to complete the 
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DRAM access from the assertion of the RAS# signal 
are dictated by the DRAM timings. In this design 60-ns 
DRAMs are used therefore 6 clocks are required to 
complete the first data access from RAS # for a total of 
8 clocks to complete the first data access of the bus 
cycle (from ADS# to the first BRDY #). 


For the read cycle, the memory controller functions as 
follows. The CPU cycle control logic samples the 
ADS # signal along with other cycle definition signals 
at the start of the second bus cycle and generates a 
DRAM cycle in progress signal. This signal along with 
the DRAM chip select signal from the address decode 
logic is input to the actual RAS# and CAS*# signal 
generation PLDs to be sampled at the start of the third 
bus cycle. The page hit/miss signal is also sampled in 
this clock, but since this cycle started with RAS # inac- 
tive, the page hit signal is not factored in and the 
DRAM cycle is initiated by the assertion of RAS#. 


The KEN # (cache enable) signal is available from the 
decode logic in the second clock. After it is determined 
that the bus cycle is a valid DRAM cycle, the KEN # 
signal is asserted to the CPU to start the cache line fill. 
This is done in the third clock since the KEN# signal 
is sampled by the CPU in the same clock in which the 
first BRDY # (or NA#) is asserted. The KEN # signal 
then stays active until the end of the cycle. 


Table 8. Cycle Timing for the Example Memory Subsystem 
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Figure 21. DRAM Read Cycle after RESET 


The address logic makes the new row address available 
to the DRAM array for both banks in the first T2 of 
the bus cycle before activation of the RAS# signal. The 
Row Select signal then switches the address multiple- 
xors for each bank to the column address path, after the 
row address hold time is satisfied. 


Since interleaving is implemented in this design, the 
CAS# signals for both banks are asserted simulta- 
neously in the sixth clock after satisfying the minimum 
RAS# to CAS# active delay. In the bus cycle shown, 
the first access is made from bank 0. The BRDY # 
signal to complete the first transfer is made three clocks 
after the CASO# signal. The three clock timing is re- 
quired to meet the CAS# to data active delay for the 
DRAMs as well as the propogation delay through the 
data transceivers. Once the first transfer is complete, 
the data path switches to bank | to complete the data 
access for the second burst transfer. The CAS# signal 
for bank O is precharged during this time to start 
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the third access. In this manner, the BRDY# signal 
can be asserted every other clock to achieve the two 
clock timing for each successive access of the burst. 


7.1.2 PIPELINED READ PAGE HIT CYCLE 


Figure 22 shows a burst read page hit cycle which is 
pipelined into another burst read cycle. As explained in 
Section 5.2, the RAS# signal remains asserted for cy- 
cles which occur to the same DRAM page. In this case, 
the page hit signal available from the page comparison 
logic in the second clock of the bus cycle determines 
whether to deassert or keep RAS# asserted. If RAS# 
stays asserted, the CAS# signals can be asserted in the 
2nd T2 or third clock of the bus cycle. With a three 
clock CAS# to data ready delay, the lead-off cycle for 
the burst read transfer on a page hit therefore requires 
five CPU clocks. 
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Figure 22. Pipelined Burst Read Page Hit Cycle 


With pipelined implementation, the next cycle to the 
memory subsystem is allowed to start before the previ- 
ous cycle is completely finished. As shown in the figure, 
the ADS# signal for memory cycle B is issued by the 
CPU during memory cycle A. This is in response to the 
NA# (Next Address) signal being active two clocks 
earlier. The way the memory subsystem functions dur- 
ing pipelining is that it latches the CPU address and 
cycle definition information for every cycle. The infor- 
mation gets latched in the second clock of the bus cycle 
with the ALE signal (generated by address latch enable 
PLD) going active. The information is kept latched un- 
til the end of the cycle. When the CPU issues the next 
cycle in response to NA#, the DRAM subsystem is 
able to perform the address decoding and page address 
comparison for the next cycle while the previous cycle 
completes. This allows the DRAM transfer correspond- 
ing to the pipelined access to be initiated as soon as the 
last BRDY # for the previous bus cycle is activated. 
The result is that the first transfer (B1) can complete in 
four CPU clocks. Each of the subsequent transfers of 
the burst cycle take 2 clocks. This effectively saves two 
clocks over the non-pipelined case (i.e., one clock for 
the idle clock that occurs between the last BRDY # 
and next ADS#, and one clock to issue the ADS #). 
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Figure 22 also shows the switching of the data trans- 
ceivers during burst read cycles. DENO is low during 
data access from DRAM bank 0 while DEN! is low 
during data access from DRAM bank 1. The enabling/ 
disabling of the transceivers is handled by the data 
transceiver enable PLD. Note that the data access A4, 
for memory cycle A and data access B1, for memory 
cycle B occur to the same bank. This requires a mini- 
mum of four clocks, 1 clock to precharge CAS#, and 
three more clocks to complete the data transfer. 


7.1.3 PIPELINED READ PAGE MISS CYCLE 


A burst read page miss cycle is illustrated in Figure 23. 
The memory cycle is shown pipelined into a previous 
burst read cycle B. The difference here is the behavior 
of the RAS# signal. Since cycle C occurs to a different 
DRAM page than cycle B (determined by the page hit 
signal), the RAS# signal is deasserted as soon as cycle 
B completes. This allows the new row address to flow 
through to the DRAMs. The timing for the page miss 
cycle adds three clocks for the RAS# precharge time 
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as well as the minimum RAS# to CAS# active delay. 
Therefore the first transfer for the page miss cycle can 
complete in nine clocks. The pipelined implementation 
can effectively save three clocks over the non-pipelined 
implementation (i.e., 1 clock between the idle clock that 
occurs between the last BRDY # and the next ADS#, 
and two clocks that occur after a cycle is started but 
before RAS# is deactivated for the precharge). 


7.2 Write Cycle Timing 


7.2.1 PIPELINED WRITE CYCLES 


In this example design, memory bus pipelining is imple- 
mented for both read as well as write cycles. Figure 24 
~ shows the timing of pipelined write cycles. Memory cy- 
cles A and C are page hits while memory cycle B is a 
page miss. 
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As mentioned earlier in the memory subsystem over- 
view section, the data logic is implemented with regis- 
ters to hold the CPU write data while the next cycle 
starts. This results in three clock timing for page hit 
writes for back to back writes cycles. Cycles that don’t 
get pipelined but are page hits can also complete in 
three clocks (from ADS# to BRDY #) because of the 
register function. However, in this case there is an idle 
clock in between the time that the CPU samples 
BRDY # and issues the next ADS#. 


For write cycles, the NA# signal for pipelining is as- 
serted in the clock immediately after the clock in which 
ADS # is asserted. This is to ensure that for page hit 
writes the ADS# for the next cycle comes out in the 
clock immediately after BRDY # for the previous cycle 
without an idle clock. The data for the write cycle gets 
driven out by the CPU in the second clock of the bus 
cycle. In the third clock, the BRDY*# signal gets as- 
serted by the controller to complete the memory write 
cycle. It is in this clock that the control logic activates 
the write data latch signal for the appropriate bank to 
latch the data in the registers. In this clock the data 
also flows through to the DRAM array as well as 
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Figure 23. Pipelined Read Page Miss Cycle 
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CAS# signal is being precharged from the previous 
write. The actual write to DRAM occurs in the fourth 
clock. It takes one more clock for a total of five clocks 
to complete the write transfer to DRAM. However, due 


to pipelining, cycles are overlapped and therefore to the 
CPU it appears as a three clock write cycle. 


For page miss write cycles, however, the BRDY # acti- 
vation to the CPU is delayed four more clocks to allow 
for the RAS# precharge and RAS# to CAS# active 
delay for write cycles. As shown in Figure 24, cycle B is 
a page miss; therefore, the page hit signal goes inactive 
in the second clock. In response to page hit signal inac- 
tive, RAS# is deasserted in the third clock and kept 
inactive for three CPU clocks to meet the RAS# pre- 
charge time. The DCP! signal, to latch the CPU write 
data, is activated in the third clock and remains active 
until the data actually gets written to the DRAMs. 
Since the page miss write cycle is pipelined into a prev- 
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ious write, the cycle is overlapped with the previous 
cycle and completes in seven clocks to the CPU. 


7.2.2 BURST WRITE CYCLES 


Figure 25 shows the timing for a burst write cycle. The 
Pentium processor supports burst writes only for write- 
back cycles. The timing for the first transfer of the 
write cycle is identical to that shown in Figure 24. Since 
the DRAM array is two-bank interleaved, subsequent 
transfers can complete in two clocks. A restriction 
placed by the Pentium processor for burst write cycles 
is that they cannot be pipelined into other cycles and 
other cycles are not pipelined into them. The other 
thing to note for burst writes is that the data transfer 
always starts with address line 3 being zero (i.e., first 
transfer always occurs to bank 0). 
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Figure 24. Pipelined Write Cycles 
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Figure 25. Burst Write Cycle 


As shown in Figure 25, the CAS# signals correspond- 
ing to each bank are asserted in an alternate fashion. 
Since the first write transfer occurs to bank 0, CASO# 
is asserted first after data for that transfer is latched in 
the registers for bank 0. Then CASO# is deasserted for 
one clock to allow for CAS# precharge. During this 
time data is being written to bank 1. Notice that 
CAS1# is kept deasserted for two clocks before being 
activated for the last write transfer (A4). This is to al- 
low enough data setup time to the DRAMs before the 
CAS# signal is activated. 
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7.3 Pipelined Read-Write Cycles 


Figure 26 shows a write cycle pipelined into a burst 
read cycle. Since the write cycle is a page hit, its timing 
is similar to that described in Figure 24 (i.e., 3 clocks). 
Notice the dead clock immediately following the last 
BRDY # of the burst read cycle. This allows the Penti- 
um processor to turn around the data buffers from be- 
ing set as inputs (for read) to outputs (for writes). The 
write data gets driven out in the clock right after the 
dead clock. The BRDY # to complete the write cycle to 
CPU then occurs in the clock following that; however, 
it still takes two more clocks to complete the write cy- 
cle to DRAM. 
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Figure 26. Write Cycle Pipelined into a Burst Read Cycle 


7.4 Pipelined Write-Read Cycles 


Figure 27 shows a burst read cycle (cycle B) pipelined 
into a write cycle (cycle A). In this case, two extra 
clocks are added to the lead-off cycle for the pipelined 
burst read (i.e., it takes 6 clocks instead of 4 clocks). 
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The reason is that a write cycle requires two more 
clocks to complete at the DRAM after the BRDY # is 
returned to the CPU as explained earlier. If the burst 
read was a page miss cycle, the number of clocks re- 
quired for the lead-off cycle would change from 9 
clocks to 11 clocks. 
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Figure 27. Burst Read Cycle Pipelined into Write Cycle 


7.5 Snoop Cycles 


Snoop cycles are initiated to the CPU in response to 
memory bus activity generated by other bus masters in 
the system such as EISA, ISA, or DMA. In this exam- 
ple the processor is always on HOLD whenever anoth- 
er master is accessing the memory subsystem. The 
AHOLD signal is not required, which is typically used 
for running inquire cycles. As soon as EISA/ISA/ 
DMA activity is detected on the processor bus, the 
EADS # signal is generated to the CPU to initiate the 
snooping. For EISA/ISA/DMA write cycles to 
DRAM, the EADS# signal is generated by the EISA 
Bus Controller. However, for read cycles, the EISA/ 
ISA cycle tracking logic (as part of the memory con- 
troller) generates the EADS# signal appropriately. 
The snoop address also does not require any special 
handling since all address lines are reflected on the ad- 
dress bus during this time. 


Figure 28 shows a CPU writeback due to a snoop hit to 
a modified cache line during an EISA read cycle. As 
soon as the EISA cycle start is detected, the EADS # 
signal is generated to the CPU. At the same time, the 
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EXRDY # signal is also asserted. The EXRDY # sig- 
nal drives the output enable of an ’F125 buffer. Since 
the EISA control signal, EXRDY is open collector, the 
assertion of EXRDY# pulls the EXRDY signal low. 
This is done to add wait states to the EISA cycle in case 
of a snoop hit. 


In response to the EADS# signal, the CPU generates 
the HITM # signal after two clocks to indicate that the 
snoop has hit a modified line and the CPU needs to 
issue a writeback cycle. If the HITM*# signal is not 
asserted after two clocks, the EXRDY # signal is deas- 
serted to allow the EISA cycle to continue without wait 
states. 


When the control logic detects the HITM# signal ac- 
tive, the HOLD signal is deasserted to the CPU to al- 
low the CPU to writeback the cache line. The ADS # 
for the writeback cycle comes out as soon as HLDA is 
deasserted. The deactivation of HLDA also causes the 
output buffers on the address and data bus conversion 
logic (CPU bus to 82350 Host Bus) to turn off to allow 
the CPU to drive the address and data bus. 
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Figure 28. Snoop Cycle—CPU Writeback during EISA Cycle 


As soon as the CPU starts the writeback cycle, HOLD 
is asserted back to the CPU in order to prevent the 
CPU from issuing another cycle immediately after the 
writeback completes and to allow the EISA read cycle 
to complete. Once the CPU has written back the cache 
line, it asserts the HLDA signal. The EXRDY # signal 
is also deasserted in order for the EISA read cycle to 
continue. 


The same sequence is followed during an EISA write 
cycle to memory except that the INV signal is generat- 
ed to the CPU along with the EADS# signal. This is 
done in order to invalidate the cache line after it has 
been written back to memory. During EISA/ISA read 
cycles, the INV signal is kept inactive to prevent the 
line from getting invalidated. In this case, only the final 
state of the cache line is affected (i.e., it is left in the 
shared state). 


Snoop cycles that are run in response to ISA activity on 
the memory bus follow the same pattern as EISA cy- 
cles. The CHRDY*#¥ signal in this case is asserted to 
drive the ISA control signal, CHRDY, low to add wait 
states to the ISA cycle in case of a snoop hit. 
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8.0 CRITICAL TIMING 
CONSIDERATIONS 


The control logic for the example memory subsystem is 
designed to operate synchronously with the CPU clock. 
At the 66-MHz clock frequency, the timing of some of 
the signal paths become critical. These critical timing 
requirements must be met for synchronous operation at 
66 MHz. The other timing constraints are dependent 
on DRAM access timings. These constrain the timing 
of the memory subsystem to a minimum number of 
wait states. 


8.1 DRAM Access Timing 


The DRAM access timing allowed by the memory con- 
troller logic is shown in Table 9 (Note: 60-ns DRAMs 
are used in this design): 
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Table 9. DRAM Access Timings 


The available margin provided in Table 9 is the time 
allowed for loading delay and clock skew etc. The as- 
sumptions for other timing parameters used in the cal- 
culations are as follows (Note: all timings are worst 
case): 


Max. RAS# activation and propagation delay through 
decode and buffer logic 
= 16.7 ns 


Max. CAS# activation and propagation delay through 
decode logic 
= 14ns 


Max. Data Transceiver propagation delay 
= 6.3 ns 


Max. Address activation (from Pentium processor) and 
propagation delay through transparent latch, multiple- 
xor and buffer logic 

= 2/.2 ns 


Min. Pentium processor Data Setup Time 
= 4ns 


Another timing parameter tasc (Column Address Set- 
up Time to CAS# active) becomes critical during pipe- 
lined page hit reads (see Section 7.0, Figure 22). In 
order to support 4 clock timing for the first access on a 
pipelined page hit read, the CAS # signal is activated in 
the second clock of the bus cycle (after the previous 
cycle completes). The column address for the first ac- 
cess must therefore be available before CAS# is acti- 
vated in order to initiate the DRAM access. To support 
this, the memory controller deasserts the ALE signal in 
the last clock of the previous bus cycle to allow enough 
time for the new address to flow through to the address 
multiplexors for the next bus cycle. The timing calcula- 
tion for this parameter is as follows: 


Max. ALE deactivation delay (through PLD 13) 
= 6.5 ns, 


Max. Address propagation delay through transparent 
latch, multiplexor and buffer logic 
= 24.2 ns, 
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Timing “tae 
Data access time from RAS # active 


Data access time from column address valid 


Data Access Time Available Margin 


Min. CAS# activation and propagation delay through 
decode logic 


= 30 ns + 6 ns — 24.2 ns —'6.5 ns 
= 5.3 ns 


The available margin is therefore 5.3 ns. This is possible 
due to the deactivation of the ALE signal one clock 
earlier (i.e., before the completion of the current cycle 
in progress). 


8.2 Controller Logic Timing 


The control logic in this design is implemented in Pro- 
grammable Logic Devices (PLDs) as a distributed state 
machine. Some of the functions of the memory control 
logic involve critical signal paths at 66 MHz. These 
include the PLDs that directly interface with the Penti- 
um processor to generate control signals for the 
DRAM subsystem or involved in input signal paths to 
the CPU. The other functions that require faster timing 
include page hit comparison, RAS# decode and data 
path switching. The logic for all other functions in the 
memory subsystem is implemented with 7.5-ns rated 
programmable logic. — 


There are four PLDs involved in directly sampling sig- 
nals generated by the Pentium processor. The signals 
ADS #, CACHE#, W/R#, and PCHK # must meet 
the minimum setup time requirement for the PLDs that 
sample them in the clock in which they are generated 
(Note: Maximum clock to output valid delay including 
signal flight time for Pentium processor generated sig- 
nals plus minimum setup time for PLDs that sample 
them and change state at the clock edge must be less 
than the time between clock edges (i.e, 15 ns @ 
66 MHz).) Since this particular timing cannot be satis- 
fied by 7.5 ns PLDs, the minimum setup time require- 
ments for PLD 1 (CPU Cycle Tracking Logic), PLD 13 
(ALE Generation), PLD 15 (NA# generation) and 
PLD 22 (Synchronize Pentium processor signals for 
33-MHz EBC) can be met by using 5-ns programmable 
logic devices. 
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The PLDs involved in input signal paths to the CPU 
include PLD 8 (BRDY# generate for read/write cy- 
cles), PLD 23 (BRDY # combine) and PLD21 (Gener- 
ates EADS#, FLUSH#, HOLD to Pentium proces- 
sor) which require faster propogation delay (tpp) or 
faster clock to output delay (tco) in order to satisfy the 
minimum input setup time requirements for the Penti- 
um processor. The margin available on the BRDY # 
input to the CPU using 5-ns programmable devices is 
as follows: 


Margin available on BRDY # 
= (1 CLK) 
— (Time to generate and combine BRDY #) 
— (Minimum Pentium processor BRDY # 
Setup Time) 
= 15 ns (4ns + 5 ns) 4ns 
= 2 ns 
For page address comparison, PLD 9 (Page hit detect 
and row decode) generates the page hit signal which 
must be available for sampling by the PLDs in the sec- 
ond level logic by the end of the second clock of the bus 
cycle. The margin available for signal flight time (for 
CPU address lines) and clock skew on the page hit sig- 
nal is as follows (see Section 5.2 for description of page 
address comparison logic): 


Margin on Page Hit signal 
= 2 CLKs 
— (Maximum Pentium processor Address Valid 
Delay) 
— (521B Comparator Propagation Delay) 
— (Maximum Prop. Delay through PLD 9) 
— (Minimum PLD setup time) 
= 30 ns — 818 — 3.5 ns — (2 ns ~ 7 os 
= 2 ns 


The above calculation assumes using a 7.5 ns PLD for 
PLD 9. By implementing the page hit detect logic in a 
5 ns programmable device, the available margin may be 
increased to 4.5 ns. 


During burst read cycles, the data transfer is alternated 
between the two banks. The margin available on the 
data path for each bank is as follows: 


Margin on Data Path Enable 

= 2 CLKs 

— (Max. delay to activate DENO/1 from PLD 10) 

— (Max. Data Transceiver, ’646A output enable 
time) 

— (Min. Pentium processor Data Setup Time) 

30 ns — 13.308 — 9.6 ns ~ 405 

2.7 ns 
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The margin available on the data path during burst 
read cycles is 2.7 ns and therefore requires the data 
transceiver enable function to be implemented in 5 ns 
programmable logic. 


9.0 SUMMARY 


This report presented performance trade-offs in memo- 
ry subsystem design for a Pentium processor-based sys- 
tem. It also discussed the design of an example memory 
subsystem without a second level cache for the Pentium 
processor. 


The following can be concluded from the material pre- 
sented in the memory performance section: 


¢ The Pentium processor bus interface is different 
than that for the Intel486 CPU. It is not dominated 
by write cycles (as was the case with the Intel486 
CPU), due to a write-back cache architecture for its 
data cache. The bus cycle mix and other simulation 
results showed that optimizations in read cycle tim- 
ing was more important than write cycle timing for 
the Pentium processor. 


© Among the read cycle timing parameters, the read 
burst timing showed more sensitivity to wait states 
than read lead off cycle timing. Approximately 3% —- 
5% loss in overall performance can result from ev- 
ery wait state added to read burst timing. 


© Optimization in write burst timing is not very crit- 
ical for Pentium processor desktop systems as it 
constitutes only a small percentage of overall bus 
activity. 


® Implementing DRAM subsystem designs to include 
two bank interleaving and DRAM paging will add 
5% —-10% to overall performance. This is important 
especially for systems that do not incorporate a sec- 
ond level cache. 


® Poor choices of bus speed, bus width and bus pipe- 
lining can degrade overall performance by 25%- 
35% in Pentium processor-based L2 cacheless sys- 
tems. 


The design example also demonstrated some key func- 


tions of Pentium processor memory subsystem design. 
These included writeback support for Pentium proces- 
sor in the DRAM subsystem; cycle pipelining for read 
and write cycles; two bank interleaving; paging, etc. 
The example also detailed the interface logic necessary 
to interface to a half speed, 32-bit bus. The simulated 
performance of this memory subsystem was found to 
range between 80% -—94% for different applications rel- 
ative to an ideal zero wait state system. 
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APPENDIX A | 
PERFORMANCE SIMULATION METHODOLOGY 


The simulation data presented in this report was ex- 
tracted from an Intel internal performance simulator. 
The simulator has the capability to simulate a Pentium 
processor, an L2 cache and a memory system. 


It also features the ability to vary memory subsystem 
parameters such as memory cycle timing, paging, pipe- 
lining, bus size, bus frequency and the ability to vary 
cache parameters for analyzing different hardware con- 
figurations. For the simulation results presented here, 
traces from five different applications running under 
DOS, UNIX and Windows were used. All the traces 
are random samples of instructions which were cap- 
tured while the respective applications were running. 
The DOS and UNIX traces are hardware generated. 
On the other hand, the Windows traces are software 
generated. 


In the hardware tracing scheme, the application is al- 
lowed to run in real time, while connected to a special 
bus monitoring hardware which stores all the bus activ- 
ity for some period of time. After the bus activity has 
been collected, it is then processed to infer the state of 
the CPU during the trace. The hardware tracing 
scheme does not have access to the internal state of the 
CPU while the application is running; however, it does 
execute in real-time and therefore includes operating 
system calls, I/O activity, timer interrupts, etc. 


The software tracing scheme, such as that used for 
Windows applications, were collected using the Micro- 
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soft Windows Kernel Debugger. This software debug- 
ger/tracer single steps through every instruction in an 
application and dumps some state information after ex- 
ecuting each instruction. The application being traced 
runs on a PC. This machine dumps the trace informa- 
tion to a serial port connected to another PC which 
captures and stores the trace. The advantage of the soft- 
ware tracing scheme is the ability to access the CPU 
state which cannot be inferred just by monitoring exter- 
nal bus activity. 


For information on acquiring a simulator and Pentium 
processor models to run simulations similar to those 
presented in this report, the following companies may 
be contacted. 


For the simulator, contact 


SES (Scientific and Engineering Software, Inc.) 
4301 Westbank Drive 

Austin, Texas 78746 

Tel: (512) 328-5544 

Fax: (512) 327-6646 


For Pentium processor models, contact 


C.A.E. Plus, Inc. 

9300 Jollyville Rd., #106 
Austin, Texas 78759 

Tel: (512) 338-0165 

Fax: (512) 338-0192 
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APPENDIX B 
PARTS LIST 


The following is a parts list for the example memory subsystem. Note only the major components used in the 
DRAM subsystem and bus conversion logic are shown. 


Component | 


Clock Driver 
66-MHz Clock Driver—Triquint GA1086 


DRAM Address Latches 

74FCT573 (Octal Transparent Latch) 

DRAM Address Multiplexors ) 

74FCT257 (Quad 2 to 1 Multiplexors) 

DRAM Data Transceivers | 

74FCT646A (Octal Transceiver/Register) 

Driver Buffers Fj 
74F244 (Octal Buffers) 

DRAM Modules 4-8 
256K x 36 or 512K x 36 (72-pin SIMM) 

Address Decode SRAM 

MT5C6408-10 (8K x 8 SRAM) 

CPU to EBC Host Bus Address Buffers 

74FCT645 (Octal Transceivers) 

Byte Enable and Data Parity Conversion oe cee 


85C060 (16 Macrocell PLD) 


CPU to EBC Host Bus (32-bit) Data Bus Buffers 
82352 (EISA Bus Buffers) 


Page Address Comparison Logic 
74FCT521B (8-bit Identity Comparator) 
74F273 (Octal D-Flip Flop) 


DRAM Control/Bus Conversion Logic (including 
EISA/ISA memory cycle tracking and address decode) 
85C220/85C224/85C22V10 PLDs 
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APPENDIX C 
PLD CODES AND SCHEMATICS 


Attached are the PLD codes and schematics that cover can be found in the Jntel486™ Microprocessor-Based 
the example Pentium Processor Memory Subsystem 82350 EISA Design Guide, Order No. 296504-001. 
and Bus Conversion Logic. Note: the schematics cover 

only the DRAM subsystem and the bus conversion log- The following is a reference list of the PLDs used in the 
ic to interface to the 82350 EISA Chip Set. Details on memory subsystem and bus conversion logic. 

EISA bus and XBUS interface with the 82350 Chip Set 


Te ee ee es 
CPU Cycle Tracking Logic N85C22V10 

2 RAS Generator N85C22V10 

4 


en ee 

inode: Sens 

| 3 | CASGeneratorforBanko | (NBC RBVIO 
ae ea CAS Generator for Bank 1 
[5 | RASDecode/RAS Precharge Count ——=SSSSSS*d;SCUNSCRIO 
[6 | CASDecode Banko SSSCSCS~*wrSC*ié CA 
a aed CAS Decode Bank 1 
[8 | BurstReady Generation forRead/wrte ——=SSS«dY~CNBSGRVIO 
[9 | PageHitComparson and Row Decode *?SCUNRSCRQO 


[13 | Adress Latch Enabe for Oycl Pipelining __——~—~SC*dtCN CVI 
[14 | Burst asi Generation Side 


13 a 
14 
| 20 | BankSelectduring EISABurstCycles | NS5C224 
4 Write Enable Generation PLD 
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module PLD1 


title § ‘DRAM CONTROLLER .- PLD 1 
INTEL CORPORATION, August 1992’ 


CPU Cycle Tracking Logic - This PLD generates the appropriate signals for tracking 


? CPU cycles to DRAM. It also generates the KEN# signal for the CPU. 
PLD1 device 'P22V10C’'; 
x,¢,2 = oe er 
" Inputs 
CLK pin 2; "CPU input clock" 
ADSn pin 3; "CPU ADS# Output" 
DRAMCSn pin 4; "DRAM Chip Select" 
BCNT pin 5: “Burst Count Active with 4th BRDY for Burst cycles" 
NAn pin 6; “CPU Next Address Input" 
7 


WRn pin : “CPU Write/Read Indication" 
PCACHEn pin 9; "CPU CACHE# output" 

HITn pin 10; "DRAM Page Hit Indicator" 
MRBRDYn pin Ws “Burst Ready for Read Cycles" 
MWBRDYn _ pin 12; "Burst Ready for Write Cycles" 


RESET pin 13; “CPU Reset Signal" 
HLDA pin 17; “CPU Hold Acknowledge Signal" \ 
KENn pin 27; "Cache Enable Output from Decode Logic" 
* Outputs 
CIPn pin 21; "CPU DRAM Cycle in Progress" 
PIPECYCn pin de "Active during Pipelined Cycles” 
CYC4X pin 23 istype ‘buffer’; “CPU Four Transfer Cycle" 
CT pin 26 istype ‘buffer’; “Cycle Track Output for Pipelining" 
P5KENn pin 18 istype ‘buffer’; “KEN# input to CPU" 
Vari pin 20 istype ‘buffer’; "State Variable" 
Var2 pin 24 istype ‘buffer’; “State Variable" 
Var3 pin 19 istype ‘buffer’; “State Variable" 


" State Register Assignments 


sreg = {[Var1, Var2, Var3]; 
So a= [0, 0, 0); 
Si = [0, 0, 1]; 
S2 = oO. 7,18 
S3 = i ae un 
S4 = [1, 0.15 


state_diagram sreg 


state SO: 
if RESET then SO with CiPn := 1; PIPECYCn := 1; endwith else 
if !ADSn & IHLDA then S1 with CiPn := 0; PIPECYCn := 1; endwith 
else SO with CIPn := 1; PIPECYCn := 1; endwith; 


state S1: 
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if RESET # DRAMCSn # (!CT.FB & BCNT) then SO with CIPn := 1; 
PIPECYCn := 1; endwith else 
if (PCACHEn & !HITn & !WRn) # (PCACHEn & !HiTn & WRn) then S2 
with CIPn := 0; PIPECYCn := 1; endwith else 
if ((PCACHEn & !NAn) # (WRn & !PCACHEn) # (WRn & PCACHEn & HITn) then S3 
with CIPn := 0; PIPECYCn := 0; endwith 
else S1 with CIPn := 0; PIPECYCn := 1; endwith; 


state S2: 
if RESET # IMWBRDYn # ('MRBRDYn & ADSn) then SO with CIPn := 1; 
PIPECYCn := 1; endwith else 
if IMRBRDYn & !ADSn then S1 with CiPn := 0; PIPECYCn := 1; endwith else 
if INAn & MRBRDYn & !WRn then S4 with CIPn := 0; PIPECYCn := 0; endwith 
else S2 with CIPn := 0; PIPECYCn := 1; endwith; 


state S3: 
if RESET # (1CT.FB & BCNT) # (!CT.FB & |CYC4X.FB & IMWBRDYn) then SO 
with CIPn := 1; PIPECYCn := 1; endwith else 
if (CT.FB & BCNT) # (CT.FB & !CYC4X.FB & IMWBRDYn) then S1 
with CIPn := 0; PIPECYCn := 1; endwith 
else S3 with CIPn := 0; PIPECYCn := 0; endwith; 


state S4: 
if RESET # (IMRBRDYn & !CT.FB) then SO with CIPn := 1; 
PIPECYCn := 1; endwith else 
if (MRBRDYn & CT.FB) then S1 with CilPn := 0; PIPECYCn := 1; endwith 
else S4 with CIPn := 0; PIPECYCn := 0; endwith; 
state_diagram [CT] 


state [0]: if RESET then [0] else 
if (HLDA & !ADSn & !PIPECYCn) then [1] else (0}: 


state [1]: if RESET then [0] else 
if (1CIPn & PIPECYCn) 
# (IMWBRDYn & !CYC4X.FB & !HITn) then [0] else [1]; 
state_diagram [P5KENn] 


state [1]: if RESET then [1] else 
if ((HLDA & !CIPn & IDRAMCSn & IKENn & !WRn) then [0] else [1]; 


state [0]: if RESET then [1] else 
if CIPn # (KENn & !1DRAMCSn) then [1] else [0]; 


equations 
[Var1, CIPn, PIPECYCn, Var2, Var3, CT, PSKENn, CYC4X].clk = CLK; 
[Var1, Var2, Var3, CT, PSKENn, CYC4X].AR = RESET; 


CIPn.OE = IHLDA; 
CYC4X:= —— (IPCACHEn) # (CYC4X.FB & IPIPECYCn); 
end PLD1; 
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module PLD2 


title § 'DRAM CONTROLLER - PLD 2 
INTEL CORPORATION, August 1992" 


: RAS Generator PLD - This PLD generates the DRAM Row Address Strobe 
: Signal (RAS#) 
PLD2 device -—=»-'P22V10C:: 
x, 6,2Z = in tome & 
"Inputs 
CLK pin 2; "CPU input clock" 
CIPn pin 3; "DRAM Cycle in Progress" 
DRAMCSn pin 4; “DRAM chip select" 
CT pin 5: "CPU Cycle Track" 
HITn pin 6; “DRAM Page Hit Signal" 
9 


BCNT pin ’ “Burst Count active with 4th BRDY for Burst Cycles" 
MRBRDYn pin 10; “Burst Ready Input for Read Cycles" 


RESET pin bP "CPU Reset Signal" 
HLDA pin 12; "CPU Hold Acknowledge Signal" 
WRn pin 13; “Write/Read signal" 
PCHG pin 16; "RAS Precharge Count" 
Won pin 25: "Write Cycle Indication for Bank 0" 
Win pin 19; "Write Cycle Indication for Bank 1" 
MiOn pin Fas "Memory/IO signal" 

"Outputs 
RASn pin 20 istype ‘buffer’; “DRAM Row Address Strobe Signal" 
Var1 pin 26 istype ‘buffer’; “State Variable" 
Var2 pin 18 istype ‘buffer’; “State Variable" 
DMEMR pin 24; “DRAM read cycle indicator" 
DMEMW pin 23; "DRAM write cycle indicator’ 
WIPn pin 21; “Write Cycle in Progress" 


"State Register Assignments 


sreg. = [Var1, Var2]; \ 
SO = [0, 0]; 
S1 = [O, 1]; 
S2 a= (1, 1]; 
$3 = [1, 0]; 


state_diagram sreg 


state SO: 
if RESET then SO with RASn := 1; endwith else 
if !CiIPn & IPCHG & !IDRAMCSn then S1 with RASn := 0; endwith 
else SO with RASn := 1; endwith; 


state S1: 
if RESET # (!CIPn & IDRAMCSn & HITn & !CT & !HLDA) # (HLDA & CIPn) 
# (}HLDA & CT & !WIPn & HITn) then SO with RASn := 1; endwith else 
if !HLDA & CT & HITn & !CIPn & IDRAMCSn & !WRn & WIPn then S2 
with RASn := 0; endwith 
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else S1 with RASn := 0; endwith; 


state S2: 
if RESET # (BCNT & IMRBRDYn) then SO with RASn := 1; endwith 
else S2 with RASn := 0; endwith; 


equations 
[Var1, RASn, Var2].clk = CLK; 
[Var1, RASn, Var2].AR = RESET; 


DMEMR = (1CIPn & MiOn & !WRn & !HLDA & !IDRAMCSn) 
# (!CIPn & HLDA & !WRn); 
DMEMW = (1CIPn & MlOn & WRn & !HLDA & IDRAMCSn) 
# (!CIPn & HLDA & WRn); 
IWIPn = !'WoOn # |Win; 
end PLD2; 
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module PLD3 


tile § ‘DRAM CONTROLLER - PLD 3 
INTEL CORPORATION, August 1992" 


: CAS Generator PLD for Bank 0 - This PLD generates the master DRAM Column 
Address Strobe Signal for Bank 0. It also generates the Write Cycle Active indication for 
. Bank 0. 


PLD3 device ‘P22V10C’; 
ee = Panis, Dotsy wlont 
“Inputs 
CLK pin =~ “CPU input clock" 
CIPn pin 3; “DRAM Cycle in Progress" 
CYC4X pin 4; “CPU four transfer cycle" 
HITn pin 5: "DRAM Page Hit Signal for CPU cycles" 
BCNT pin 6; “Burst Count active with 4th BRDY for Read Cycles" 
MRBRDYn pin 7; “Burst Ready Input for Read Cycles" 
MWBRDYn _ pin 9; "Burst Ready Input for Write Cycles" 


RESET pin 10; "CPU Reset Signal” 

HLDA pin 17; "CPU Hold Acknowledge Signal" 
WRn pin 12; “Read/Write Cycle Indicator" 
DRAMCSn pin 13; "DRAM Chip Select" 

RASn pin 16; “Row address strobe signal" 


REFRESHn pin 17; 
BANKSEL pin 18; 


"Refresh sequence indicator" 
“Banksel indicates Bank 0 or Bank 1" 


CT pin eS i “CPU Cycle Track for pipelined cycles" 
ALE pin 26; "Address Latch Enable Signal” 
"Outputs 
ICASOn pin 23 istype ‘invert’; “CAS Signal for Bank 0" 
!won pin 20 istype ‘invert’; “Active for Write Cycles to Bank 0" 
Var1 pin 24 istype ‘buffer’; “State Variable" 
Var2 pin 25 istype ‘buffer’; “State Variable" 
Var3 pin 21 istype ‘buffer’; “State Variable" 


"State Register Assignments 


sreg. = [CASOn, WOn, Var1, Var2, Var3]; 

1) = [0, 0, 0, 0, Oj; "“CASOn = 1; WOn = 1; 0 
S1 = [0, 0, 0, 0, 1]; "“CASOn = 1; WOn= 1; 1 
$2 = [0, 0,0, 1,0}; “CASOn = 1;WOn=1; 2 
$3 = (1, 0, 0, 0, 0]; “CASOn = 0; WOn = 1; 0 
S4 = [1, 0, 0,0, 1]; "“CASOn =0; WOn = 1; 1 
S5 = [0, 0,0, 1, 1]; "“CASOn = 1;W0On=1; 3 
S6 = {1,0,0, 1,0]; "“CASOn =0;WOn=1; 2 
S7 = [0, 1,0, 0, 0]; "“CASOn = 1; WOn =0; 0 
S8 = [1, 1,0, 0,0]; "CASOn = 0; WOn =0; 0 
S9 = [1, 1,0, 0, 1]; "“CASOn =0;W0On=0; 1 
$10 = [0, 1,0, 0, 1]; "“CASOn = 1; WOn=0; 1 
S11 = [0, 1,0, 1,0}; "“CASOn =1;WOn=0; 2 
Si2. = [0, 0, 1,0, 0]; "“CASOn = 1; WOn=1; 4 

state_diagram sreg ; 
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state SO: 
if REFRESHn & !CIPn & IDRAMCSn & !WRn & !RASn & !ALE & 
(1BANKSEL # CYC4X) then S3 
else if REFRESHn & !CiPn & IDRAMCSn & !'WRn & RASn then S1 
else if REFRESHn & !CiPn & !DRAMCSn & WRn then S7 
else SO; 


state S1: 
if ((HLDA & HITn) # (BANKSEL & !CYC4X) # WRn # 
(BANKSEL & HLDA) then SO 
else if (IBANKSEL & !CT & !RASn) # (!HLDA & !RASn & ICT & CYC4X) then S2 
else if (IBANKSEL & CT & !RASn) # (}!HLDA & !RASn & CT & CYC4X) then S3 
else S1; . 


state S2: 
goto $12; 


state S3: 
if (CYC4X & IMRBRDYn & !HLDA) then SO 
else if IMRBRDYn & IBANKSEL & !HLDA then S5 
else if ((MRBRDYn & BANKSEL) # HLDA then S4 
else S3; 


state S4: 
if IMRBRDYn & !HLDA then S5 
else if HLDA then S6 
else S4; 


state S5: 
goto S6; 


state S6: 
if (BCNT & !CT) # HLDA then SO 
else if (BCNT & CT & !HLDA) then S1 
else S6; 


state S7: 
if (BANKSEL & !CYC4X) then SO 
else if !HLDA & !RASn & ICYC4X & IMWBRDYn & !BANKSEL then S9 
else if (IRASn & CYC4X & IMWBRDYn) # (!RASn & HLDA) then S10 
else S7; 


state S8: 
if HLDA # CIPn # DRAMCSn # !WRn # BANKSEL then SO 
else S11; 


state S9: 
goto S8; 


state S10: 
goto S9; 


state S11: 
if RASn then S7 
else if IMWBRDYn then S9 
else if CiIPn then SO 
else S11; 
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state S12: 
goto S3; 


equations 
[Var1, Var2, Var3, CASOn, WOn].clk = CLK; 


[Var1, Var2, Var3, CASOn, WOn].AR 


ul 
DD 
m 
“” 
m 
= 


end PLD3; 
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"DRAM Page Hit Signal for CPU cycles" 
“Burst Count active with 4th BRDY for Burst Cycles" 
"Burst Ready Input for Read Cycles" 


“Burst Ready Input for Write Cycles" 


"Banksel indicates Bank 0 or Bank 1 access" 
"CPU Cycle Track for Pipelined cycles" 


"CAS Signal for Bank 1" 

“Active for Write Cycle to Bank 1" 
"State Variable" 

"State Variable" 


AP-478 
module PLD4 
title ‘DRAM CONTROLLER - PLD 4 
INTEL CORPORATION, August 1992’ 
: CAS Generator PLD for Bank 1 - This PLD generates the master DRAM Column 
" Address Strobe Signal (CAS) for Bank 1. It also generates the Write Cycle Active 
. indication for Bank 1. 
PLD4 device '‘P22V10C'; 
Xx, C,zZ = Pasa dy, ales 
"Inputs 
CLK pin 2; "CPU input clock" 
CIPn pin 3; "DRAM Cycle in Progress" 
CYC4X pin 4; “CPU four transfer cycle" 
HITn pin 5; 
BCNT pin 6; 
MRBRDYn pin 7 
MWBRDYn pin 9; 
RESET pin 10; "CPU Reset Signal" 
HLDA pin aa "CPU Hold Acknowledge Signal” 
WRn pin 12; "Read/Write Cycle Indicator’ 
DRAMCSn pin 13; "DRAM Chip Select" 
RASn pin 16; “Row address strobe signal" 
REFRESHn pin 17; “Refresh sequence indicator" 
BANKSEL pin 18; 
CT pin Pt i 
ALE pin 26; “Address Latch Enable Signal" 
"Outputs 
ICAS1n pin 23 istype ‘invert’; 
Win pin 25 istype ‘invert’; 
Var1 pin 24 istype ‘buffer’; 
Var2 pin 20 istype ‘buffer’; 
Var3 pin 21 istype ‘buffer’; 


S10 
S11 
$12 


” 
o 
ace oe oe on ee ee 


state_diagram sreg 
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State Register Assignments 


"State Variable" 


[CAS1n, Win, Var, Var2, Var3); 


[0, 0, 0, 0, 0]; 
[0, 0, O, 0, 1]; 
[0, 0, O, 1, 0]; 
it, & ¢, 0, of: 
71, 0,0; 0, 1h 
[o, 6, 0, 1, 1; 
1,2, 0, 1, Gt 
[0, 1, 0, 0, 0}; 
[t, 1,-0, 0, i 
it, 4, @, 0, 1k 
(0,1, 0; 0,.1}; 
[0, 1, 0, 1, 0]; 
(0, 0, 1, 0, OF 


"CASin = 1; Win=1; 
"CAS1n = 1; Win =1; 
"CASin = 1; Win = 1; 
"CAS 1n = 0; Win = 1; 
"CAS1n = 0; Win = 1; 
"CASin = 1; Win = 1; 
"CAS1n = 0; W1n = 1; 
"CASin = 1; Win =0; 
"CASin = 0; Win = 0; 
"CASin = 0; W1n = 0; 
"CASin = 1; Win =0; 
"CAS1n = 1; Win =0; 
"CASin = 1; Win =1; 


hMyO-"-$"OONW-"ON—-"O 
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state SO: 
if REFRESHn & !CIPn & IDRAMCSn & !WRn & !RASn & !ALE & 
(BANKSEL # CYC4X) then S3 else 
if REFRESHn & !CiPn & WRn & IDRAMCSn then S7 else 
if REFRESHn & !CiPn & IDRAMCSn & !WRn & RASn then S1 
else SO; 


state S1: 
if (HLDA & HITn) # (!BANKSEL & !ICYC4X) # WRn # 
('BANKSEL & HLDA) then SO else 
if (BANKSEL & !RASn & !CT) # (!HLDA & !RASn & CYC4X & !CT) then S2 else 
if (BANKSEL & !RASn & CT) # (!HLDA & !RASn & CYC4X & CT) then S3 
else S1; 


state S2: 
goto S12; 


state S3: 
if (ICYC4X & IMRBRDYn & !HLDA) then SO else 
if IMRBRDYn & BANKSEL & !HLDA then S5 else 
if (MRBRDYn & !BANKSEL) # HLDA then S4 
else S3; 


state S4: 
if IMRBRDYn & !HLDA then S5 else 
if HLDA then S6 
else S4; 


state S5: 
goto S6; 


state S6: 
if (BCNT & !CT) # HLDA then SO else 
if (BCNT & CT) then S1 
else S6; 


state S7: 
if ((IBANKSEL & !CYC4X) then SO else 
if IHLDA & !RASn & ICYC4X & IMWBRDYn & BANKSEL then S9 else 
if ((RASn & CYC4X & IMWBRDYn) # (!RASn & HLDA) then S10 
else S7; 


state S8 : / 
if HLDA # CIPn # !WRn # DRAMCSn 
# (IBANKSEL & !CYC4X) then SO 
else S11; 


state S9: 
goto S8; 


state S10: 
if IMWBRDYn # HLDA then S9 
else S10; 


state S11: 
if RASn then S7 
else if IMWBRDYn then S9 
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else S11; 


state S12: 
goto S3; 


equations 


[Var1, Var2, Var3, CAS1n, Win].clk 
[Var1, Var2, Var3, CAS1n, W1n].AR 


end PLD4; 
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CLK; 
RESET; 
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module PLD5 


title ‘DRAM CONTROLLER - PLD 5 
INTEL CORPORATION, August 1992' 


: RAS Decode and RAS Precharge Count PLD - This PLD decodes RAS signals 
. for the DRAM array. It also generates the RAS precharge count signal for miss cycles. 
PLD5 device 'E0320'; 
ee | = Pray: Metey lle 
"Inputs 
CLK pin 5 "CPU input clock" 
RESET pin 4; “CPU Reset Signal" 
ROWAn pin 5 "Row A" 
ROWBn pin 6; "Row B" 
HIMEMn pin 7; "Upper Half of 512K indicator" 
RASn pin 8; “Row Address Strobe Signal” 
"Outputs 
PCHG pin 12; “RAS Precharge Count" 
RASAOn pin 13; “Row Address Strobe for Lower Half of 512K in Row A" 
RASA1n pin 14; "Row Address Strobe for Upper Half of 512K in Row A" 
RASBOn pin 15; “Row Address Strobe for Lower Half of 512K in Row B" 
RASBin pin 16; “Row Address Strobe for Upper Half of 512K in Row B" 
Vari pin 17; “State Variable" 
Var2 pin 18; "State Variable" 


“State Register Assignments 


sreg = [Var1, Var2]; 
so = [0, 0]; 
S1 = [0, 1]; 
S2 = [1, 1]; 
S3 = [1, 0]; 


state_diagram sreg 


state SO: 
if RESET then SO with PCHG := 0; endwith else 
if !RASn then S1 with PCHG := 1; endwith 
else SO with PCHG := 0; endwith; 


state S1: 
if RESET then SO with PCHG := 0; endwith else 
if RASn then S2 with PCHG := 1; endwith 
else S1 with PCHG := 1; endwith; 


state S2: 
goto SO with PCHG := 0; endwith; 


state S3: 
goto SO with PCHG := 0; endwith; 


equations 
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[Vari, PCHG, Var2].clk = CLK; 


AP-478 
!RASAOn 
IRASA1n 
{RASBOn 
IRASB1n 
end PLDS5; 
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![RASn & HIMEMn & !ROWAn; 
IRASn & HIMEMn & !ROWAn; 
[RASn & HIMEMn & !ROWBn; 
IRASn & HIMEMn & !ROWBn; 


241562-40 


PRELIMINARY 


intel i : AP-478 


module PLD6 


tile § ‘DRAM CONTROLLER - PLD 6 
INTEL CORPORATION, August 1992' 


; CAS Decode PLD for Bank 0" 
PLD6 device 'E224C'; 
oy = ey deg he 
"Inputs 
CASOn pin af; “Column Address Strobe for Bank 0" 
DMEMR pin > “DRAM Read Cycle" 
DMEMW pin 3; “DRAM Write Cycle" 
LBEOn pin 4; “Latched Byte Enables BEO# - BE7#" 
LBE1n | pin 5: 
LBE2n pin 6; 
LBE3n pin 7; 
LBE4n pin 9; 
LBE5n pin 10; 
LBE6n pin 11; 
LBE7n pin 12; 
" Outputs 
CASO0n pin 18; "CAS Decode outputs for Bank 0" 
CASOI1n pin 19; . 
CASO2n pin 20; : 
CASO3n pin 21; a 
CAS04n pin 23; : 
CASO5n pin 24; , 
CAS06n pin 25; " 
CASO7n pin 26; g 
equations 
ICASOOn = = (ICASOn & DMEMR) # (!CASOn & DMEMW & !LBEOn); 
ICASOIn (1CASOn & DMEMR) # (!CASOn & DMEMW & !LBE1n); 
ICASO2n (!1CASOn & DMEMR) # (!CASOn & DMEMW & !LBE2n); 
ICASO3n (!1CASOn & DMEMR) # (!CASOn & DMEMW & !LBE3n); 
ICASO4n (1CASOn & DMEMR) # (!CASOn & DMEMW & !LBE4n); 
ICASO5n (1CASOn & DMEMR) # (!CASOn & DMEMW & !LBE5n); 
ICASO6n (!1CASOn & DMEMR) # (!CASOn & DMEMW & !LBE6n); 
!CASO7n (!1CASOn & DMEMR) # (!CASOn & DMEMW & !LBE7n); 
end PLD6; 
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module PLD7 


title 


‘DRAM CONTROLLER - PLD 7 


INTEL CORPORATION, August 1992’ 


CAS Decode PLD for Bank 1" 


a, ©, 2 


Inputs 


CAS1n 
DMEMR 
DMEMW 
LBEOn 
LBE1n 
LBE2n 
LBE3n 
LBE4n 
LBE5n 
LBE6n 
LBE7n 


Outputs 


CAS10n 
CAS11n 
CAS12n 
CAS13n 
CAS14n 
CAS15n 
CAS16n 
CAS17n 


equations 


ICAS10n 
ICAS11n 
ICAS12n 
!1CAS13n 
ICAS14n 
ICAS15n 
ICAS16n 


ICAS17n 


end PLD7; 
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device 


'E224C'; . 
ag hadi: stan 
pin 27: “Column Address Strobe for Bank 1" 
pin 2 "DRAM Read Cycle" 
pin 3; "DRAM Write Cycle" 
pin 4; “Latched Byte Enables BEO# - BE7#" 
pin 5; 
pin 6; 
pin rj 
pin 9; 
pin 10; 
pin 1; 
pin 12; 
pin 18; "CAS Decode outputs for Bank 1" 
pin 19; " 
pin 20; " 
pin rl 
pin 23; * 
pin 24; " 
pin 25; * 
pin 26; " 


(1CAS1n & DMEMR) # (!CAS1n & DMEMW & !LBEOn); 


(‘CASin & DMEMR) # (!'CAS1n & DMEMW & !LBE1n); 


(!CASin & DMEMR) # (!CASin & DMEMW & !LBE2n); 


(}CAS1n & DMEMR) # (!CASin & DMEMW & !ILBE3n); 


(‘}CAS1n & DMEMR) # (!CASin & DMEMW & !LBE4n); 


(‘1CASin & DMEMR) # (!CASin & DMEMW & !LBE5n); 


(‘CAS1n & DMEMR) # (!CASin & DMEMW & !LBE6n); 


(‘CAS1n & DMEMR) # (!CAS1n & DMEMW & !LBE7n); 
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module PLD8 


title 


X,C,Z 


‘DRAM CONTROLLER - PLD 8 


INTEL CORPORATION, August 1992’ 


Generates Burst Ready Signals for CPU Read and Write Cycles" 


device 


Inputs 


CLK 
CIPn 
DRAMCSn 
WRhn 
CASOn 
CAS1n 
BCNT 
HITn 
CYC4X 
HLDA 
RESET 
RASn 
CT 
ALE 


" Outputs 


MRBRDYn 
MWBRDYn 
Var1 
Var2 
Var3 
Var4 
Var5 
Var6 


'E224C': 


Pony Hoty, SO 
pin 2; 
pin 16; 
pin 3; 
pin 4; 
pin 5; 
pin 6; 
pin 7; 
pin 17; 
pin 9; 
pin 10; 
pin 11; 
pin 12; 
pin 13; 
pin ef; 
pin 18; 
pin 19; 
pin 20 
pin 21 
pin 23 
pin 24 
pin 25 
pin 26 


" State Register Assignments 


PRELIMINARY 


sreg1 
SO 
S1 
S2 
S3 


w” 
ovens 
wal, 
unk unt tb uu 


"CPU input clock" 

"DRAM Cycle in Progress" 
“DRAM Chip Select" 
“Write/Read Indicator" 

"CAS# for Bank 0" 

"CAS# for Bank 1" 

“Burst Count Active with 4th BRDY for Burst Cycles" 
"DRAM Page Hit Signal” 

"CPU four transfer cycle" 

"CPU Hold Acknowledge Signal" 
“CPU Reset Signal" 

“RAS# Signal" 

"CPU Cycle Track Signal" 
“Address Latch Enable Signal" 


“Burst Ready for CPU Read Cycles" 
"Burst Ready for CPU Write Cycles" 
istype ‘buffer’; "State Variable" 
istype ‘buffer’; “State Variable" 
istype ‘buffer’; “State Variable" 
istype ‘buffer’; “State Variable" 
istype ‘buffer’; “State Variable“ 
istype ‘buffer’; “State Variable" 


[Var1, Var2, Var3]; 


[0, 0, 0); 
[O, 1, 0); 
1, 1, 01; 
(0; 7, 1]; 
ity ta 4a 


[Var4, Var5, Var6}; 


[0, 0, 0); 
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state_diagram sreg1 


state SO: 
if RESET then SO with MRBRDYn := 1; endwith else 
if ((CIPn & IDRAMCSn & !WRn & !CASOn & !HLDA & !ALE) 
# (ICIPn & IDRAMCSn & !WRn & !CAS1n & !HLDA & !ALE) then S1 
with MRBRDYn := 1; endwith 
else SO with MRBRDYn := 1; endwith; 


state S1: 
if RESET then SO with MRBRDYn := 1; endwith else 
if |CYC4X then S2 with MRBRDYn := 0; endwith-else 
if CYC4X then S3 with MRBRDYn := 0; endwith 
else S1 with MRBRDYn := 1; endwith; 


state S2: 
goto SO with MRBRDYn := 1; endwith; 


state S3: | 
if RESET # BCNT then SO with MRBRDYn := 1; endwith else 
if |BCNT then S4 with MRBRDYn := 1; endwith 

else S3 with MRBRDYn := 0; endwith; 


state S4: 
if RESET # BCNT then SO with MRBRDYn := 1; endwith else 
if |BCNT then S3 with MRBRDYn := 0; endwith 
else S4 with MRBRDYn := 1; endwith; 


state_diagram sreg2 


state S8: 

if RESET then S8 with MWBRDYn := 1; endwith else 

if (1CIPn & IDRAMCSn & WRn & IHLDA & CYC4X & !ALE) then S12 
with MWBRDYn := 1; endwith else 

if (1CIPn & !IDRAMCSn & WRn & !HLDA & HITn & !ALE). 
# (!CIPn & IDRAMCSn & WRn & IHLDA & RASn & !ALE) then S9 
with MWBRDYn := 1; endwith else 

if (CIPn & IDRAMCSn & WRn & !HITn & !RASn & !HLDA & !ALE) then S10 
with MWBRDYn := 0; endwith 

else S8 with MWBRDYpn := 1; endwith; 


state S9: 
if RESET then S8 with MWBRDYn := 1; endwith else 
if !RASn then S10 with MWBRDYn := 0; endwith 
else S9 with MWBRDYn := 1; endwith; 


state S10: 
if RESET # (!CT & !HITn) then S8 with MWBRDYn := 1; endwith else 
if CT & 'H!Tn then S11 with MWBRDYn := 1; endwith else 
if HITn then S9 with MWBRDYpn := 1; endwith 
else S10 with MWBRDYn := 0; endwith; 


state S11: 
if RESET then S8 with MWBRDYn := 1; endwith 
else S9 with MWBRDYn := 1; endwith; 


state S12: 
if RESET then S8 with MWBRDYn := 1; endwith 
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else S13 with MWBRDYn := 0; endwith; 
state S13: 
if RESET # BCNT then S8 with MWBRDYn := 1; endwith else 
if |BCNT then S14 with MWBRDYn := 1; endwith; 
state S14: 
if RESET # BCNT then S8 with MWBRDYn := 1; endwith 
else if IBCNT then S13 with MWBRDYn := 0; endwith; 


state S15: 
goto S8 with MWBRDYn := 1; endwith; 


equations 
[Var1, MRBRDYn, MWBRDYp, Var2, Var3, Var4, Var5, Var6}.clk = CLK; 


end PLD8; 
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module PLD9 


title ‘DRAM CONTROLLER - PLD 9 
INTEL CORPORATION, August 1992’ 


DRAM Page Hit comparison and Row Decode PLD" 


PLD9 device ‘E0320’; 
G2 2 Pees Teakig ted 
"Inputs 
A21 pin ¥ "CPU Address Line A21" 
A22 pin 2; "CPU Address Line A22" 
A23 pin 3; "CPU Address Line A23" 
PA21 pin 4; “Previous Address Line A21" 
PA22 pin 5; "Previous Address Line A22" 
PA23 pin 6; “Previous Address Line A23" 
PROWAn pin 7; “Previous DRAM ROW A decode" 
PROWBn pin 8; "Previous DRAM ROW B decode" 
PHIMEMn pin 9; “Previous DRAM High/Low Memory Bank Decode" 
MSIZA pin 11; "MSIZ = 0 for 256Kx36 module, MSIZ = 1 for 512Kx36 module" 
MSIZB pin 12; "MSIZA for Row A, MSIZB for Row B" 
HIT1n pin 13; “Address Comparison of A13-A20 with PA13-PA20" 
“ Qutputs 
HIT2n pin 14; “Active if A21, 22, 23 match previous PA21, 22, 23" 
HIT3n pin 15; “Active if previous Row matches current Row and Bank" 
HITn pin 16; “DRAM Page Hit Signal" 
HIMEMn pin 17; "Indicates access to upper half of 512K byte" 
ROWAn pin 18; "DRAM Row A decode" 
ROWBn pin 19; "DRAM Row B decode" 
equations 
IROWAnN == = (!A23 & !A22) # (MSIZA & !A23 & A22); 
IROWBn = (!IMSIZA & !A23 & A22) # (MSIZA & A23 & !A22) # (MSIZB & A23 & A22); 


!HIMEMn = (MSIZA & !A23 & A22) # (MSIZB & A23 & A22); 


HIT2n = (!A21 & PA21) #(A21 & IPA21) # (IA22 & PA22) # (A22 & IPA22) 
# (IA23 & PA23) # (A23 & !PA23); 
HIT3n = {((1A23 & A22 & IMSIZA & IMSIZB & PHIMEMn & PROWAn & !PROWBn) 
# (!A23 & A22 & MSIZA & !MSIZB & !PHIMEMn & !PROWAn & PROWBn)):; 
HITn = (HITin) # (HIT2n) # (HIT3n); 
end PLD9; 
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module 


title 


A, &, & 


"Inputs 


CLK 
CIPn 


PLD10 


‘DRAM CONTROLLER - PLD 10 
INTEL CORPORATION, August 1992' 


Data Transceiver Enable for Bank 0 and Bank 1" 


DRAMCSn 
BANKSEL 
MRBRDYn 


WRn 
CYC4X 
HLDA 
RESET 
RASn 
CT 


" Outputs 


DENOA 
DEN1A 
Var1 
Var2 
DENIC 
DENOB 
DEN1B 
DEN 


device 'E224C': 

= Dixy. wholieg alles 

pin 2; "CPU input clock" 

pin 12; “DRAM Cycle in Progress" 

pin 3; "DRAM Chip Select" 

pin 4; "BANKSEL indicates Bank 0 or Bank 1 access" 

pin 5; “BRDY for Read cycles" 

pin 6; "Write/Read Indicator" 

pin 7; "CPU four transfer cycle" 

pin 10; “CPU Hold Acknowledge signal" 

pin 9; “CPU Reset Signal" 

pin 13; "DRAM RAS# signal" 

pin 16; “CPU Cycle Track signal" 

pin 18; "Data XCVR Enable Bank 0 (single rd/wr, burst rd/wr bank 0)" 
pin 26; "Data XCVR Enable Bank 1 (burst read bank 1, BANKSEL = 0)" 
pin 19; “State Variable" 

pin 20; "State Variable" 

pin at; “Data XCVR Enable Bank 1 (burst writes)" 

pin 23; "Data XCVR Enable Bank 0 (burst read bank 0, BANKSEL = 1)" 
pin 24; "Data XCVR Enable Bank 1 (single rd/wr, burst read bank 1)" 
pin 25; 


" State Register Assignments 


state_diagram sreg 


sreg = [Var1, Var2, DEN]; 

SO = 0:0, oF 

S1 = - 10,0. 1); 

S2 = 0, 1, 1); 

S3 = {0, 1,0); 

S4 = 71,4. 

S5 = fr, 1, 0k 

DRAMCYCn = = CIPn #DRAMCSn # RASn; 


state SO: 


state S1 


if RESET then SO else 
if ((DRAMCYCn & ICT & ICYC4X & !HLDA) 
# (IDRAMCSn & !CiPn & ICT & CYC4X & IHLDA & WRn) 
# (IDRAMCYCn & HLDA) then S1 else 
if (DRAMCYCn & ICT & CYC4X & !IHLDA & !WRn) then S2 
else SO; 


if RESET # CIPn # (IMRBRDYn & !HLDA) then SO 


: PRELIMINARY 
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else S1; 
state S2: if RESET then SO else 
if IMRBRDYn then S3 
else S2: 
state S3: if RESET then SO else 
if IMRBRDYn then S4 
else S3; 
state S4: if RESET then SO else 
if IMRBRDYn then S5 
else S4; 
state S5: if RESET # IMRBRDYn then SO 
else S5; 
equations 
{[Var1, Var2, DEN].clk = CLK; 
DENOA = !IDEN.FB; 
DEN1A = IDENOA; 
DENOB = IDEN1B; 
DEN1B = IDEN.FB; 
DENIC = 1DEN.FB; 
DENOA.OE = IBANKSEL; 
DEN1A.OE = CYC4X & IHLDA & IBANKSEL & DENOA & 'WRn & IDRAMCYCn; 
DENOB.OE = CYC4X & IHLDA & BANKSEL & DEN1B & !WRn & IDRAMCYCn; 
DEN1B.0E = BANKSEL; 
DEN1C.OE = CYC4X & IHLDA & WRn & IBANKSEL; 
end PLD10; 
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module PLD11 


title ‘DRAM CONTROLLER - PLD 11 
INTEL CORPORATION, August 1992’ 


Activates Write Data Register for Bank 0 and Bank 1" 


PLD11 device 'E0320'; 
x, 6, zZ = Fret Ney sok 
“Inputs 
CLK pin 1; “CPU input clock 
CiPn pin a "DRAM Cycle in Progress 
PIPECYCn pin 3; "Active during pipelined cycles 
BANKSEL pin 4; "Indicates Bank 0 or Bank 1 access 
MWBRDYn pin 8 "BRDY for Write Cycles 
CYC4X pin 6; "CPU four transfer cycle 
HLDA pin le "CPU Hold Acknowledge output 
RESET pin 8; "CPU Reset Input 
CASOn pin 9; "CAS# for Bank 0 
CASin pin 11; “CAS# for Bank 1 
DMEMW pin Te, "DRAM write cycle 
LAS pin 19; “Latched Address line 3 signal 
"Outputs 
DCPO pin 13; “Activates Write Data Register for Bank 0" 
DCP1 pin 14; “Activates Write Data Register for Bank 1" 
Var1 pin 15; "State Variable" 
Var2 pin 16; "State Variable" 
Var3 pin 17 "State Variable" 
Var4 pin 18; "State Variable" 


" State Register Assignments 


sregi = [DCPO, Var1, Var2]; 
S00 = [0, 0, 0}; 
S01 = [1, 0, O]; 
S02 = ii, 0, if 
S03. = [0, 0, 1]; 
S04 = (1, 1, 1); 
S05 = 1.4, 15 
sreg2 = [DCP1, Var3, Var4]; 
S10 = [0, 0, 0); 
S11 = t1,.0, OF 
S$i2 = [0, 1, 0]; 
$13 = [1, 1, 0}; 
S14. = [O, 1, 1]; 
515 .= [0, 0, 1]; 
Si6 = ie TE 


state_diagram sreg1 
state SOO: if RESET then S00 else 


if (1CIPn & PIPECYCn & ILA3 & DMEMW & !CYC4X 
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& IHLDA) # (!CIPn & !LA3 & DMEMW 


intel. 
& HLDA) then S01 else 


if (1CIPn & IBANKSEL & DMEMW & CYC4X & !HLDA & PIPECYCn) 


then S02 
else SOO; 
state S01: if RESET # !CASOn then SOO 
else S01; 
state S02 : if RESET then SOO else 
if (CASOn & CYC4X) then SO3 else 
if ((CASOn & !CYC4X) then SOO 
else S02; 
state S03: if RESET then SOO 
else S04; 
state S04 : if RESET # !CASOn then S05 
else S04; 
state S05 : goto S00; 
state_diagram sreg2 
state S10: if RESET then S10 else 
if (CIPn & PIPECYCn & LA3 & DMEMW & !CYC4X 
& 'HLDA) # (!CIPn & LAS & DMEMW 
& HLDA) then S11 else 
if (CIPn & IMWBRDYn & DMEMW & CYC4X & !HLDA) then S12 
else S10; 
state S11: if RESET # !CASin then S10 
else S11; 
state S12: if RESET # ICYC4X then S10 
else $13; 
state S13: if RESET then S10 
else if !CASin then S14 
else S13; 
state S14: if RESET then S10 
else S15; 
state S15: if RESET then S10 
else S16; 
state S16: if RESET # CAS1n then S10 
else S16; 
equations 
[Var1, Var2, Var3, Var4, DCPO, DCP1].clk = CLK; 
end PLD11; 
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® : 
module PLD12 


title §'DRAM CONTROLLER - PLD 12 
INTEL CORPORATION, August 1992' 


This PLD generates Bank Select during CPU Burst Cycles," 
Burst Count during Burst Read and Writeback Cycles," 
and Row Address Latch Enable for Page Comparison" 


PLD12 device 'E0320'; 

46.2 wma eh, wee 

"Inputs 
CLK pin y "CPU Clock Input 
CIPn pin 2; "DRAM Cycle in Progress 
DRAMCSn pin 3; "DRAM Chip Select 
LAS pin 4; “Latched Address line A3 
MRBRDYn pin 5; "BRDY# for Read Cycles 
MWBRDYn pin 6; "BRDY# for Write Cycles 
CYC4X pin Tj "CPU four transfer cycle 
HLDA pin 8; “CPU HOLD Acknowledge Output 
RESET pin 9; “CPU RESET signal 
HITn pin 1d; "DRAM Page Hit Signal 

" Qutputs 
BANKSEL pin 13; “Bank Select during CPU burst cycles" 
Var1 pin 14; "State Variable" 
BCNT pin 15: “Burst Count active with 4th BRDY" 
Var2 pin 16; "State Variable" 
Var3 pin 17: "State Variable" 
Var4 pin 18; "State Variable” 
RALE pin 19; “Row Address Latch Enable“ 
BCNT3 pin 12; "active with 3rd BRDY" 


"State Register Assignments 


sregl = [BANKSEL, Var1]; 

S00 = [0, 0}; 

$0i = [1, 0]; 

S02 = [O, 1]; 

S03 = (1, 1]; 

sreg2 = [Var2, Var3, Var4, BCNT]; 

$10 = [0, 0, O, 0}; 

S11 = [0, 0, 1, 0}; 

a iD, 1, 1. OF 

$13: = [0, 1, 0, 0); 

Ssi¢ «= a, 1, 0 GF 

S15 = vst, 15 9h 
state_diagram sreg1 

state SOO: 

if RESET then SOO else 


if (1CIPn & IDRAMCSn & LA3 & !HLDA) then S01 else 
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if (ICIPn & IDRAMCSn & !LA3 & !HLDA) then S02 
else SOO; 


state S01: 
if RESET then S00 else 
if (CYC4X & BCNT.FB & IMRBRDYn) # (CYC4X & BCNT.FB & IMWBRDYn) 
# (ICYC4X & IMRBRDYn) # (ICYC4X & IMWBRDYn) then S03 
else S01; 


state S02: 
if RESET # (CYC4X & BCNT.FB & IMRBRDYn) # 
(CYC4X & BCNT.FB & IMWBRDYn) # (ICYC4X & IMRBRDYn) # 
(1CYC4X & IMWBRDYn) then SOO 
else S02; 


state S03 : 
if RESET then SOO else 
if (1CIPn & IDRAMCSn & LA3 & !HLDA) then S01 
else SOO; 


state_diagram sreg2 


state S10: 
if RESET then S10 with BCNTS := 0; endwith else 
if (CIPn & IDRAMCSn & CYC4X & !HLDA) then S11 with BCNTS := 0; endwith 
else S10 with BCNTS := 0; endwith; 


state S11: 
if RESET # ICYC4X then S10 with BCNTS := 0; endwith else 
if IMRBRDYn # !MWBRDYn then S12 with BCNTS := 0; endwith 
else S11 with BCNTS := 0; endwith; 


state S12: 
if RESET then S10 with BCNT3 := 0; endwith else 
if {'MRBRDYn # IMWBRDYn then S13 with BCNTS3 := 0; endwith 
else $12 with BCNTS := 0; endwith; 


state S13: 
if RESET then S10 with BCNTS := 0; endwith else 
if IMRBRDYn # IMWBRDYn then S14 with BCNTS := 1; endwith 
else S13 with BCNTS := 0; endwith; 


state S14: 
if RESET then S10 with BCNTS := 0; endwith 
else S15 with BCNTS := 0; endwith; 


state S15: 
goto S10 with BCNTS := 0; endwith; 


state_diagram [RALE] 


state [0] : 
if RESET then [0] else 
if !ClIPn & !IDRAMCSn & HITn then [1] 
else [0]; 


state [1]: 
if RESET # !HITn then [0] 
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else [1]; 
equations 
[Var1, BCNTS, Var2, Var3, Var4, BANKSEL, BCNT, RALE].clk = CLK; 
BANKSEL.OE = _ !HLDA; 


end PLD12; 
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module PLD13 


title 


‘DRAM CONTROLLER - PLD 13 


INTEL CORPORATION, August 1992’ 


PLD13 device 


X,c,Z = 


Inputs 


CLK 

CIPn 
DRAMCSn 
ADSn 
MRBRDYn 
MWBRDYn 
CYC4X 
HLDA 
RESET 
WRn 

NAn 
BCNT 
MSBURSTn 
DBCLK 
CMDn 
BCNT3 


Outputs 


IALE 
Var1 
Var2 


This PLD Generates the address latch enable for cycle pipelining” — 


'P22V10C'; 
a tees te 
pin Co "CPU Input Clock 
pin 27; “DRAM Cycle in Progress 
pin 3; "DRAM Chip Select 
pin 4; “CPU Address Status Output 
pin 3: "BRDY# for Read Cycles 
pin 6; "BRDY# for Write Cycles 
pin 7; "CPU Four Transfer Cycle 
pin yb "CPU Hold Acknowledge Signal 
pin 9; "CPU Reset Input 
pin 10; “Write/Read Indicator 
pin 11; "CPU Next Address Input 
pin 13; “Burst Count Active with 4th BRDY for Burst Cycles 
pin 26; ~ “EISA Burst Cycle Indicator 
pin 25; "Delayed BCLK 
pin 16; “EISA Cycle Command Indicator 
pin 23; “Burst Count Active with 3rd BRDY for Burst Cycles 
pin 20 istype ‘invert’; 
pin 18 istype ‘buffer’; “State Variable" 
pin 19 istype ‘buffer’; "State Variable" 


"State Register Assignments 


sregi = [ALE, Var1, Var2]; 
So — [0, 0, 0}; "ALE = 1; 
S1 = [1, 0, O]; "ALE = 0; 
$2 = 1, 0, 1]; "ALE = 0; 
S3 = i, 7, Ti "ALE = 1; 
S4 = iy, t. 0 "ALE = 0; 
$5 = [O, 1, 0]; "ALE = 1; 
state_diagram sreg1 
state SO: 
if RESET then SO else 
if (1CIPn & IDRAMCSn) # (!ADSn & !HLDA) then S1 
else SO; 
state S1: 


if RESET # CIPn # DRAMCSn # (CYC4X & BCNT & WRn & !HLDA) 
# (MSBURSTn & CIPn & HLDA) then SO else 

if INAn & !HLDA then S2 else 

if !IDBCLK & IMSBURSTn & HLDA & !CMDn then S3 
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else S1; 


state S2: 
if RESET # (ICYC4X & IMRBRDYn & ADSn) # CIPn then SO else 
if 1CYC4X & IMRBRDYn & !ADSn) then S3 else 
if IMWBRDYn then S4 else 
if (BCNT3 & !WRn & CYC4X) then S5 
else S2; 


state S3: 
if RESET # CiPn # DRAMCSn # (MSBURSTn & DBCLK) then SO else 
if (CIPn & IDRAMCSn) # (ODBCLK & !MSBURSTn & HLDA & !CMDn) then S1 
else S3; 


state S4: 
goto SO; 


state S5: 


if RESET then SO 
else S3; 


[ALE, Vari, Var2].clk = CLK; 
[ALE, Var1, Var2].AR =RESET; 


end PLD13; 
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module PLD14 


title 


‘DRAM CONTROLLER - PLD 14 


INTEL CORPORATION, August 1992’ 


device 


xX, ©,.z ss 


Inputs 


CLK 

CIPn 
DRAMCSn 
LA4 

LA13 
MRBRDYn 
MWBRDYn 
CYC4X 
HLDA 
RESET 
RASn 
BCNT 
ALE 


Outputs 


BOAMAO 


2) 
G 
ae ee ie ee 


state_diagram sreg 


state SO: 


Burst Address Generation PLD - This PLD generates the Burst Address and supplies the 
lowest address line to the DRAM array. 


'E224C'; 


Re ARB te 
pin 2; 
pin af, 
pin 3; 
pin 4; 
pin 5; 
pin. 7; 
pin 12; 
pin 9; 
pin 10; 
pin » i I 
pin 13; 
pin 16; 
pin 7; 
pin 24; 
pin 25; 
pin 26; 
pin 18; 
pin 19; 
pin 20 
pin 21 
‘pin 23 


State Register Assignments 


"CPU Input Clock 

“DRAM Cycle in Progress 
“DRAM Chip Select 

“Latched Address line A4 
“Latched Address line A13 
"BRDY# for Read Cycles 
“BRDY# for Write Cycles 

“CPU Four Transfer Cycle 

“CPU Hold Acknowledge Output 
“CPU Reset input 

"DRAM RAS# signal 

“Burst Count Active with 4th BRDY# for Burst Cycles 
“Address Latch Enable signal 


"AO for Bank 0, RowA 
"AO for Bank 0, Row B 
"AO for Bank 1, RowA 
"AO for Bank 1, Row B 


"State Variable 
“State Variable 
"State Variable 


istype ‘buffer’; 
istype ‘buffer’; 
istype ‘buffer’; 


[Var1, Var2, Var3}; 


fO, 0, 0); 
[O, 0, 1]; 
10, 1, Of; 
13. 4, 15 
iy, 1,08 
it; ty Bh 
11, 0, 1]; 


if RESET then SO with BA := 1; endwith else 
if ((CIPn & IDRAMCSn & !LA4 & !HLDA & !ALE) then S1 with BA := 0; endwith else 
if (1CIPn & IDRAMCSn & LA4 & !HLDA & !ALE) then S2 with BA := 1; endwith 


else SO with BA := 1; endwith; 
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state S1: 
if RESET then SO with BA := 1; endwith else 
if (MRBRDYn # IMWBRDYn) & CYC4X then S3 with BA := 0; endwith else 
if IMRBRDYn & !CYC4X then SO with BA := 1; endwith else 
if IMWBRDYn & !CYC4X then S5 with BA := 0; endwith 
else S1 with BA := 0; endwith; 


state S2: 
if RESET then SO with BA := 1; endwith else 
if (MRBRDYn # IMWBRDYn) & CYC4X then S4 with BA := 1; endwith else 
if IMRBRDYn & ICYC4X then SO with BA := 1; endwith else 
if IMWBRDYn & !CYC4X then S6 with BA := 1; endwith 
else S2 with BA := 1; endwith; 


state S3: 
if RESET then SO with BA := 1; endwith else 
if (MRBRDYn # IMWBRDYn) & !BCNT then S2 with BA := 1; endwith else 
if BCNT then SO with BA := 1; endwith _ 
else S3 with BA := 0; endwith; 


state S4: 
if RESET then SO with BA := 1; endwith else 
if (MRBRDYn # !MWBRDYn) & !BCNT then S1 with BA := 0; endwith else 
if BCNT then SO with BA := 1; endwith 
else S4 with BA := 1; endwith; 


state S5: 
if RESET # ALE then SO with BA := 1; endwith 
else S5 with BA := 0; endwith; 


state S6: 
if RESET # ALE then SO with BA := 1; endwith 
else S6 with BA := 1; endwith; 

equations 


[Var1, BA, Var2, Var3].clk = CLK; 


BOAMAO = (BA &!RASn & !HLDA) #(LA13 & RASn) # (LA4 & !RASn & HLDA); 

BOBMAO = (BA&!RASn & !HLDA) # (LA13 & RASn) # (LA4 & !RASn & HLDA); 

BiAMAO = (BA&!RASn & !HLDA) # (LA13 & RASn) # (LA4 & !RASn & HLDA); 

BiBMAO = (BA&!RASn & !HLDA) # (LA13 & RASn) # (LA4 & !RASn & HLDA); 
end PLD14; | 


241562-61 


PRELIMINARY 3-95 


AP-478 


module PLD15 


title 


PLD15 


X,C,2Z 


‘DRAM CONTROLLER - PLD 15 
INTEL CORPORATION, August 1992' 


Generate NA# input to the CPU for pipelining" 


device 


"Inputs 


CLK 
ADSn 
CiPn 
PIPECYCn 
DRAMCSn 
BCNT 
WRn 
CACHEn 
HITn 
RESET 
HLDA 

CT 

MiOn 
P5BRDYn 


Outputs 


NAn 
Var1 
Var2 
Var3 
CPUIORD 
CPUIOWR 


'E224C'; 


a ae ae A 
pin 2; 
pin 3; 
pin 4: 
pin 5; 
pin 6; 
pin i; 
pin 9; 
pin 10; 
pin 11; 
pin 12; 
pin 13; 
pin 16; 
pin 17; 
pin re 
pin 19; 
pin 20; 
pin 21; 
pin 18; 
pin 23; 
pin 24; 


"State Register Assignments 


state_diagram sreg 


sreg 
So 
S1 
S2 
$3 
S4 


state SO: 
_ if RESET then SO with NAn := 1; endwith else 
if !ADSn & WRn & IPIPECYCn & !HLDA then S3 with NAn := 1; endwith else 
if !'ADSn & WRn & CACHEn & !HITn & !HLDA then S2 with NAn := 0; endwith else 
if (ADSn & !WRn & PIPECYCn & !HLDA) 

# (!ADSn & WRn & HiTn & CACHEn) then S1 with NAn := 1; endwith else 
if (ADSn & IWRn & IPIPECYCn & !HLDA) then S4 with NAn := 1; endwith 


else SO with NAn := 1; endwith; 


State S1: 
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"CPU input clock" 

"CPU ADS# Output" 

"DRAM Cycle in Progress 
“Active for Pipelined Cycles 
“DRAM Chip Select" 

“Burst Count Active with 4th BRDY for Burst Cycles" - 
"CPU Write/Read Indication" 
"CPU CACHE# output" 

"DRAM Page Hit Indicator" 
“CPU Reset Signal" 

“CPU Hold Acknowledge Signal” 
"Cycle Track Output 
"Memory/IO Indicator 

“BRDY# input to the CPU 


"Next Address Input for CPU" 
"State Variable" 
"State Variable" 
"State Variable" 
"I/O Read generated by CPU 
"I/O Write generated by CPU 


(Vari, Var2, Var3); 


[0, 0, O]; 
[0, 0, 1); 
(0S Fey 
0.7, Gr 
My elk 
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if RESET then SO with NAn := 1; endwith else 


if ((CIPn & IDRAMCSn & !WRn & !HiTn) # (WRn & CACHEn & !HITn & !CT) 


# (!CIPn & IDRAMCSn & !WRn & HITn & !PSBRDYn) then S2 
with NAn := 0; endwith 
else S1 with NAn := 1; endwith; 


state S2: 
goto SO with NAn := 1; endwith; 


state S3: 


if RESET then SO with NAn := 1; endwith else 
if BCNT then S1 with NAn := 1; endwith 
else S3 with NAn := 1; endwith; 


state S4: 
if RESET then SO with NAn := 1; endwith else 


if (PIPECYCn & !CIPn & IDRAMCSn & !WRn & !HITn) then S2 with NAn := 0; endwith 
else S4 with NAn := 1; endwith; 


equations 


(Vari, NAn, Var2, Var3, CPUIORD, CPUIOWR).clk = CLK; 


CPUIORD := (!ADSn & !MlOn & !WRn & !HLDA) # (CPUIORD.FB & P5BRDYn); 


CPUIOWR := (!ADSn & IMlOn & WRn & !HLDA) # (CPUIOWR.FB & P5BRDYn); 
end PLD15; 
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module PLD16 


title ‘DRAM CONTROLLER - PLD 16 
INTEL CORPORATION, August 1992' 


Buffers and generates Byte Enables between CPU Memory Bus and EBC Host bus" 


PLD16 device ‘EO600C'; "Implemented with Intel 85CO60 PLD" 
2 t.a = Ming aie alle 
“Inputs 


ISAMEMW pin 3; "ISA DRAM Write Cycle 
EISAMEMW opin 13; "EISA DRAM Write Cycle 
CPUIORD pin 17; “CPU I/O Read Cycle 

HITMn pin ai; “CPU Hit to a Modified Line 
HLDA pin 4; "CPU Hold Acknowledge Output 


" Input/Outputs 


HBE3n pin 5; “Host Bus Byte Enables 0-3 for 32-bit Bus 
HBE2n pin 6; 
HBE1n pin 7; 
HBEOn pin 8; 
HA2 pin 9; “Address Line 2 for 32-bit bus 
BE7n pin 26; "CPU Byte Enable 0-7 for 64-bit Bus 
BE6n pin 20; 
BE5n pin 24; 
BE4n pin 23; 
BE3n pin 22; 
BE2n pin 2 
BEin pin 20; 
BEOn pin 18; 
"Outputs 
HDOEn _ pin 12; “Controls Output Enables on CPU bus side of Data Buffers 
"(EBBs) between 64- & 32-bit bus. 
HAOEn pin 10; "Controls Output Enable of Address Buffers between CPU bus 
“and 32-bit Host Bus 
equations 
IBE7n =. HA2 #!HBESn; 
BE7n.OE = HLDA & HITMn; 
IBE6n = HA2 #!HBE2n; 
BE6n.OE = HLDA & HITMn; 
IBE5n = HA2 #!HBEin; 
BE5n.OE = HLDA & HITMn; 
IBE4n = HA2 #!HBEOn; 
BE4n.OE = HLDA & HITMn; 
IBE3n = !HA2 #!HBESn; 
BE3n.OE = HLDA & HITMn; 
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IBE2n 
BE2n.OE 


IBE1n 
BE1n.OE 


IBEOn 
BEOn.OE 


HA2 
HA2.0E 


IHBE3n 
HBE3n.OE 


IHBE2n 
HBE3n.OE 


IHBE1n 
HBE3n.OE 


IHBEOn 
HBE3n.OE 


IHDOEn 


IHAOEn 


end PLD16; 


PRELIMINARY 


IHA2 # !HBE2n; 
HLDA & HITMn; 


IHA2 # !HBE1n; 
HLDA & HITMn; 


IHA2 # !HBEOn; 
HLDA & HITMn; 


IBE7n # !BE6n # IBE5n # !BE4n; 
IHLDA; | 


(HA2 & !BE7n) # (!HA2 & !BE3n): 
IHLDA; 


(HA2 & IBEGn) # (IHA2 & !BE2n); 
IHLDA: 


(HA2 & IBE5n) # (IHA2 & !BE1n); 
IHLDA; 


(HA2 & !BE4n) # (IHA2 & IBEOn); 
IHLDA: 


(ISAMEMW & !HAOEn) # (EISAMEMW & !HAOEn) 
# (CPUIORD & !HAOEn); 


HITMn; 
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module PLD17 


tile § ‘DRAM CONTROLLER - PLD 17 
INTEL CORPORATION, August 1992" 


. ISA State Tracker - This PLD tracks ISA cycles to DRAM" 

PLD17 device 'E0320'; 

x, Cc, 2 = > 4RY Mer £ 

"Inputs 
CLK pin 1; "CPU Input Clock 
SEMSTR16n__ pin 2; “Signal to indicate 16-bit ISA master (synchronized) 
SMRDCn pin 3; "ISA Memory Read Control Strobe (synchronized) 
SMWTCn pin 4; "ISA Memory Write Control Strobe (synchronized) 
HLDA pin a "CPU Hold Acknowledge Output 
HITMn pin 6; “CPU Hit to a Modified Line Signal 
DRAMCSn pin ¥ "DRAM Chip Select 
DRAMDISn pin 8; "DRAM Disable Signal 
RESET pin 9; “CPU Reset Signal 

" Outputs 


ISAMEMR pin 12: "ISA Read to DRAM 

ISAMEMW pin 13; "ISA Write to DRAM 

CIPn pin 14; "DRAM Cycle in Progress (ISA Cycles) 
NCHRDYn pin 15; "CHRDY active to Hold off ISA bus master 
EADSRiIn pin 16; "EADS to CPU for ISA Read Cycles 


Var1 pin 17; “State Variable 
Var2 pin 18; "State Variable 
Var3 pin 19; "State Variable 


" state register assignments 


sregi = (Vari, Var2, Var3]; 

SO = [0, 0, 0); 

S1 = [0, 0, st 

S2 = IO; 1; 1: 

$3 = (0, 1, O}; 

S4 = be 

ISARD = IDRAMCSn & DRAMDISn & !SMRDCn & HLDA & !SEMSTR16n; 
ISAWR = IDRAMCSn & !SMWTCn & HLDA & !SEMSTR16n; 


state_diagram sreg1 


state SO: 
if RESET then SO with CIPn := 1; NCHRDYn := 1; EADSRiIn := 1; endwith else 
if ((ISARD # ISAWR) then S1 with CIPn := 1; NCHRDYn := 0; EADSRin := 0; endwith 
else SO with CIPn := 1; NCHRDYn := 1; EADSRin := 1; endwith; 


state S1: 
if RESET then SO with CiPn := 1; NCHRDYn := 1; EADSRiIn := 1; endwith 
else S2 with CIPn := 1; NCHRDYn := 0; EADSRIn := 1; endwith; 
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state S2: 
if RESET then SO with CIPn := 1; NCHRDYn := 1; EADSRin := 1; endwith 
else S3 with CIPn := 1; NCHRDYn := 0; EADSRiIn := 1; endwith; 


state S3: 
if RESET then SO with CiPn := 1; NCHRDYn := 1; EADSRiIn := 1; endwith else 
if HITMn then S4 with CiIPn := 0; NCHRDYn := 1; EADSRin := 1; endwith 
else S3 with CIPn := 1; NCHRDYn := 0; EADSRIn := 1; endwith; 


state S4: 
if RESET # SEMSTR16n # (SMRDCn & SMWTCn) then SO 
with CIPn := 1; NCHRDYn := 1; EADSRin := 1; endwith 
else S4 with CiPn := 0; NCHRDYn := 1; EADSRiIn := 1; endwith; 
equations 
[Var1, CIPn, NCHRDYn, EADSRin, Var2, Var3].clk = CLK; 
CIPn.OE = HLDA & ISEMSTR16n & HITMn; 
ISAMEMR = __ISARD: 
ISAMEMW = _— ISAWR; 


end PLD17; 
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module PLD18 


title ‘DRAM CONTROLLER - PLD 18 
INTEL CORPORATION, August 1992' 


‘ EISA State Tracker - This PLD tracks EISA cycles to DRAM" 

PLD18 device ‘P22V10C'; 

1,C,zZ = . Se se 

"Inputs 
CLK pin 2: “CPU Clock Input 
EMSTR16n pin ve "Signal to indicate 16-bit ISA master 
CMDn pin 3; "EISA Command Signal 
STARTn pin 4; "EISA Cycle Start Signal 
MSBURSTn pin 5; "Active for EISA Burst Cycles 
DBCLK pin 6; "Delayed BCLK 
DRAMCSn pin 7; "DRAM Chip Select 
DRAMDISn pin 12; "DRAM Disable Signal 
RESET pin 9; "Reset Signal 
HLDA pin 10; “CPU Hold Acknowledge Signal 
HITMn pin LEE “CPU Hit to a Modified Line Signal 
REFRESHn pin 13; “Active for Refresh Cycles 

" Qutputs 
CIPn pin 18; "DRAM Cycle in Progress (EISA cycles) 
BNKSWTCH pin 19; "Bank Switch for EISA cycles 
EADSREn pin 20; “EADS# to CPU for EISA Read Cycles 
NEXRDYn pin Bi; “"“EXRDY# active to hold off EISA bus master 
Var1 pin 23 istype ‘buffer’; “State Variable 
Var2 pin 24 istype ‘buffer’; "State Variable 
Var3 pin 25 istype ‘buffer’; “State Variable 
Var4 pin 26 istype ‘buffer’; "State Variable 


"state register assignments 


sregi = [Var1, Var2, Var3, Var4]; 
SO = [0, 0, 0, 0); 
S1 = [0, 0, 0, 1]; 
S2 = (0, 0, 4, TT; 
$3 we 10, 1, 44 15 
S4 = 0, 1, 1,1); 
S5 = i, 11,0) 
S6 = [1, 1, 0, 0}; 
S7 = [1, 0, 0, 0]; 
S8 = [0, 1, 0, 0}; 
GDRAMCS' = IDRAMCSn # !REFRESHn; 
state_diagram sreg1 
state SO: 
if RESET then SO 


with CIPn := 1; NEXRDYn := 1; EADSREn := 1; BNKSWTCH := 0; endwith else 
if (DBCLK & GDRAMCS & !STARTn & HLDA & EMSTR16n & DRAMDISn) then S1_ 
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with CIPn := 1; NEXRDYn := 0; EADSREn := 0; BNKSWTCH := 0; endwith 
else SO with CiPn := 1; NEXRDYn := 1; EADSREn := 1; BNKSWTCH := 0; endwith; 


state S1: 
if RESET then SO 


with CiPn := 1; NEXRDYn : 


else S2 with CIPn := 1; NEXRDYn 


state S2: 
if RESET then SO 
with CIPn := 1; NEXRDYn 
else S3 with CIPn := 1; NEXRDYn 


state S3: 

if RESET then SO 

with CIPn := 1; NEXRDYn 
if HLDA & HITMn then S6 

with CIPn := 0; NEXRDYn 
if 1HLDA & HITMn then S4 

with CIPn := 1; NEXRDYn 
else S3 with CIPn := 1; NEXRDYn 


state S4: 
if RESET then SO 
with CIPn := 1; NEXRDYn 
if HLDA & HITMn then S5 


with CIPn := 1; NEXRDYn 
else S4 with CIPn := 1; NEXRDYn 


state S5: 
if RESET then SO 
with CIPn := 1; NEXRDYn 
else S6 with CIPn := 0; NEXRDYn 


state S6: 


= 1; EADSREn := 1; BNKSWTCH 
:= 0; EADSREn := 1; BNKSWTCH 
= 1; EADSREn := 1; BNKSWTCH 


-= 0; EADSREn := 1; BNKSWTCH 


:= 1; EADSREn := 1; BNKSWTCH 
‘= 1; EADSREn := 1; BNKSWTCH 
:= 0; EADSREn := 1; BNKSWTCH 


:= 0; EADSREn := 1; BNKSWTCH 
:= 1; EADSREn := 1; BNKSWTCH 


‘= 0; EADSREn := 1; BNKSWTCH 
:= 0; EADSREn := 1; BNKSWTCH 


‘= 1; EADSREn := 1; BNKSWTCH 
‘= 1; EADSREn := 1; BNKSWTCH 


if RESET # (CMDn & DBCLK & STARTn) then SO 


with CIPn := 1; NEXRDYn 


:= 1; EADSREn := 1; BNKSWTCH 


if !DBCLK & !CMDn & IMSBURSTn then S7 


with CIPn := 0; NEXRDYn 
else S6 with CIPn := 0; NEXRDYn 


state S7: 
if RESET then SO 
with CIPn := 1; NEXRDYn 
if DBCLK then S8 
with CIPn := 0; NEXRDYn 
else S7 with CIPn := 0; NEXRDYn 


state S8: 


:= 1; EADSREn 


‘= 1; EADSREn := 1; BNKSWTCH 
= 1; EADSREn := 1; BNKSWTCH 


‘= 1; EADSREn := 1; BNKSWTCH 


:= 1; BNKSWTCH 
‘= 1; EADSREn := 1; BNKSWTCH 


if RESET # (CMDn & DBCLK) # (MSBURSTn & !DBCLK) then SO 


with CIPn := 1; NEXRDYn 
if !|DBCLK & !MSBURSTn then S7 
with CIPn := 0; NEXRDYn 


:= 1; EADSREn := 1; BNKSWTCH 
:= 1; EADSREn := 1; BNKSWTCH 


:= 0; endwith 
‘= 0; endwith; 


‘= 0; endwith 
= 0; endwith; 


‘= 0; endwith else 
:= 0; endwith else 


:= 0; endwith 
:= 0; endwith; 


:= 0; endwith else 


‘= 0; endwith 
= 0; endwith; 


:= 0; endwith 
:= 0; endwith; 


:= 0; endwith else 


‘= 1; endwith 
:= 0; endwith; 


‘= 0; endwith else 


= 0; endwith 
= 1; endwith; 


‘= 0; endwith else 


‘= 1; endwith 


else S8 with CIPn := 0; NEXRDYn := 1; EADSREn := 1; BNKSWTCH := 0; endwith; 


equations 


[Var1, CiIPn, NEXRDYn, EADSREn, BNKSWTCH, Vare, Var3, Var4].clk = CLK; 
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Sse  [Vart, CIPn, NEXRDYn, EADSREn, BNKSWTCH, Var2, Var3, Var4] AR = RESET: 
CIPn.OE = HLDA & EMSTR16n & HITMn & DRAMDISn; 


ro end PLD18; 
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module PLD19 


title ©§ ‘DRAM CONTROLLER - PLD 19 
INTEL CORPORATION, August 1992' 


: Buffers and generates Data Parity between CPU Memory Bus and EBC Host bus" 
PLD19 device 'E0600C’; “Implemented with Intel 85CO60 PLD" 
X,C,2 = Me G52. 

" Inputs 


ISAMEMR pin 3; "ISA DRAM Read Cycle 
EISAMEMR spin 4; "EISA DRAM Read Cycle 
CPUIOWR pin 12; “CPU I/O Write Cycle 


HITMn pin 13; "CPU Hit to a Modified Line Signal 
HLDA pin 17; "CPU Hold Acknowledge Output 
HA2 pin 27; “Address Line 2 on the 32-bit Host Bus 


" Input/Outputs 


: “Host Data Parity 7-0 for 32-bit Bus 


HDP3 7 pin 5 
HDP2 pin 6; 
HDP1 pin 7: 
HDPO pin 8; 
DP7 pin 26; “CPU Data Parity 7-0 for 64-bit Bus 
DP6 pin 25; 
DP5 pin 24; 
DP4 pin 23; 
DP3 pin 22; 
DP2 pin 21; 
DP1 pin 20; 
DPO pin 18; 
" Outputs 
EDOELn pin 10; “Controls Output Enable for lower 32 bits on Host Bus side of " 
| “data buffers (EBBs)." 
EDOEHn pin 9; “Controls Output Enable for upper 32 bits on Host Bus side of " 
“data buffers (EBBs).” 
equations 
DP7 = HA2 # DP3; 
DP7.0OE ot HLDA & HITMn; 
DP6 = HA2 # DP2; 
DP6.0E = HLDA & HITMn; 
DP5 = HA2 # DP1; 
DP5.0E = HLDA & HITMn; ~ 
DP4 = HA2 # DPO; 
DP4.0E = HLDA & HITMn; 
DP3 = _ |HA2 # DP3: 
DP3.0E o HLDA & HITMn; 
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DP2 
DP2.0E 


DP1 
DP1.0E 


DPO 
DP0O.OE 


HDP3 
HDP3.OE 


HDP2 
HDP2.0E 


HDP1 
HDP1.0E 


HDPO 
HDPO.OE 


!EDOEHn 


!EDOELn 


end PLD19; 
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lHA2 # DP2; 
HLDA & HITMn; 


IHA2 # DP1; 
HLDA & HITMn; 


IHA2 # DPO; 
HLDA & HITMn; 


(HA2 & DP7) # (!HA2 & DP3); 
IHLDA; 
(HA2 & DP6) # (!'HA2 & DP2); 
IHLDA; 


(HA2 & DPS) # (!HA2 & DP1); 
!HLDA; 


(HA2 & DP4) # (!HA2 & DPO); 
IHLDA; 


(ISAMEMR & HITMn & HA2) 
# (EISAMEMR & HITMn & HA2) 
# (CPUIOWR & HITMn & HA2); 
(ISAMEMR & HITMn & !HA2) 


# (EISAMEMR & HITMn & !HA2) 
# (CPUIOWR & HITMn & !HA2); 
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module 


title 


PLD20 
X,C,Z 


" Inputs 


PLD20 


‘DRAM CONTROLLER - PLD 20 
INTEL CORPORATION, August 1992' 


Generates Bank Select during EISA Burst Cycles" 


device 


CLK 
EMSTR16n 
CMDn 
STARTn 
MSBURSTn 
DBCLK 
DRAMCSn 
DRAMDISn 
RESET 
HLDA 
BNKSWTCH 
WRhn 

LA3 

CIPn 
ISAMEMR 


“ Outputs 


BANKSEL 
EISAMEMR 
EISAMEMW 
Var1 

Var2 
PSTROBEn 


'E224C’; 


Puig le Bh 


" state register assignments 


sreg1 
SO 
S1 
S2 


state_diagram sreg1 


PRELIMINARY 


state SO: 


pin 2; 
pin ef: 
pin 3; 
pin 4: 
pin 5; 
pin 6; 
pin 7: 
pin 9; 
pin 10; 
pin i; 
pin 12; 
pin 13; 
pin 16; 
pin bg 
pin 26; 
pin 19; 
pin 20; 
pin 215 
pin 23 
pin 24 
pin 25; 
(Vari, Var2]; 


[0, 0}; 
[O, 1]; 
‘fee 


“CPU Input Clock 

“Active for 16-bit ISA master 
“EISA Command Signal 

"EISA Cycle Start Signal 

“Active for EISA Burst Cycles 
"Delayed BCLK 

"DRAM Chip Select 

"DRAM disable signal 

“CPU Reset Signal 

"CPU Hold Acknowledge Output 
“Bank Switch Output of EISA Cycle Tracker 
“Write/Read Indicator 

“Latched Address line A3 
"Cycle in Progress Signal 

"ISA Read Cycle to DRAM 


“Bank Select for EISA Cycles 
"“EISA Read Cycle to DRAM 
“EISA Write Cycle to DRAM 
istype ‘buffer’; "State Variable 
istype ‘buffer’; “State Variable 
“Parity Strobe for EISA/ISA cycles 


if RESET then SO with BANKSEL := 1; endwith else 

if !CiPn & !LA3 & HLDA then S1 with BANKSEL := 0; endwith else 
if !CiPn & LA3 & HLDA then S2 with BANKSEL := 1; endwith 

else SO with BANKSEL := 1; endwith; 


state S1: 


if RESET # CMDn # (MSBURSTn & CiPn & STARTn) then SO 
with BANKSEL := 1; endwith else 

if BNKSWTCH then S2 with BANKSEL := 1; endwith 

else S1 with BANKSEL := 0; endwith; 
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state S2: 
if RESET # CMDn # (MSBURSTn & CIPn & STARTn) then SO 
with BANKSEL := 1; endwith else 
if BNKSWTCH then S1 with BANKSEL := 0; endwith 
else S2 with BANKSEL := 1; endwith; 


equations 
[Var1, BANKSEL, Var2].clk = CLK; 
BANKSEL.OE = _ HLDA; 


EISAMEMR = _ !DRAMCSn & DRAMDISn & !CMDn & !WRn & HLDA & EMSTR16n; 
EISAMEMW = !DRAMCSn &!CMDn & WRn & HLDA & EMSTR16n; 
IPSTROBEn = ISAMEMR # EISAMEMR; 

end PLD20; 
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module PLD21 


tile | 'DRAM CONTROLLER - PLD 21 
INTEL CORPORATION, August 1992’ 


‘ Generates EADS#, FLUSH# and HOLD to CPU" 

PLD21 device 'E0320'; 

xX, C, 2 = Pisy Hora sles 

" Inputs 
CLK pin 1; "CPU input clock" 
SMCLK pin 2: "Delayed 33-MHz CLK (skewed by 5 ns)" 
EADSRiIn pin 3; "EADS# due to ISA Read Cycles 
EADSREn pin 4; "EADS# due to EISA Read Cycles 
EBCEADSn pin 5; "EADS# signal generated by EBC 
RESET pin 6; "CPU Reset Signal 
SLOWHn pin ye "Slow Hold Function (from Timer in ISP) 
HHOLD pin 8; "CPU Hold Request from EBC 
FLUSH pin 9; "Cache Flush Signal 
HLDA pin 11; "CPU Hold Acknowledge Output 

“ Outputs 
P5EADSn pin 18; "EADS# input to CPU" 
P5HOLD pin 17; "HOLD input to CPU" 
P5FLUSHn pin 16; "FLUSH# input to CPU" 
P5INV pin 19; "INV input to CPU" 
SHOLD pin 15; = 
SFLUSHn pin 14; . 
Var1 pin 13; “State Variable” 
Var2 pin 12; "State Variable" 


“ State Register Assignments 


[Var1, Var2]; 
[0, 0]; 
[0, 1]; 
ri, 1 
[1, 0); 


sreg 
SO 
S1 
S2 
$3 


EADSW = ISMCLK & HLDA & !EBCEADSn; 
state_diagram sreg 


state SO: 
if RESET then SO with SFLUSHn := 1; endwith else 
if FLUSH then S1 with SFLUSHn := 0; endwith 
else SO with SFLUSHn := 1; endwith; 


state S1: 
if RESET then SO with SFLUSHn := 1; endwith 
else S2 with SFLUSHn := 0; endwith; 


state S2: 
goto SO with SFLUSHn := 1; endwith; 
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equations 


[Var1, SFLUSHn, Var2, SHOLD, PS5HOLD).clk = CLK; 
IPSEADSn = !EADSRiIn # IEADSREn # EADSW; 


SHOLD ss := !SLOWHn & !IRESET; 
P5HOLD := HHOLD # SHOLD; 
IPSFLUSHn = !SFLUSHn # SHOLD; 
P5INV = EADSW; 

end PLD21; 
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module PLD22 


tile § 'DRAM CONTROLLER - PLD 22 
INTEL CORPORATION, August 1992' 


2 Generate ADS#, PCHK#, KEN#, and CPUMISS# for System (EBC)" 
PLD22 device ‘E0320’; 
X, C, Zz = gPrsg' tay sllend 
" Inputs 
CLK pin 13 "CPU input clock" 
SMCLK pin 2: "Delayed 33-MHz CLK (skewed by 5 ns)" 
BREQ pin 3; "CPU Bus Request 
HLDA pin 4; “Hold Acknowledge from CPU 
KENn pin a, "KEN# output from Decode logic 
WRhn pin 6; "Write/Read Indicator 
MiOn pin 7; "Memory/lO indicator 
ADSn pin 8; "ADS# from CPU 
PCHKn pin 9; "PCHK# from CPU 
RESET pin 1): "CPU Reset Signal 
" Outputs 
HADSn pin 12; "ADS# regenerated for 33-MHz bus 
HPCHKn pin 13; “"PCHK# regenerated for 33-MHz bus 


CPUMISSn pin 14; "Generated for ISP in order to include CPU in arbitration 
EBCKENn pin 15; "KEN# signal generated for EBC | 


Var1 pin 16; “State Variable 
Var2 pin 1 "State Variable 
Var3 pin 18; “State Variable 
Var4 pin 19; "State Variable 


"State Register Assignments 


(Vari, Var2]; 
[0, 0}; 
(0, 1]; 
(1, 1]; 
[1, O}; 


= 
Huw ow 


[Var3, Var4]; 
[0, 0]; 
(0, 1]; 
(1, 1]; 
[1, 0]; 


state_diagram sreg1 


state SO: 
if RESET then SO with HADSn := 1; endwith else 
if !ADSn & SMCLK then S1 with HADSn := 0; endwith else 
if !ADSn & !SMCLK then S2 with HADSn := 0; endwith 
else SO with HADSn := 1; endwith; 


state S1: 
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if RESET then SO with HADSn := 1; endwith 
else S3 with HADSn := 0; endwith; 


state S2: 


if RESET then SO with HADSn := 1; endwith 
else S1 with HADSn := 0; endwith; 


state S3 : 
goto SO with HADSn := 1; endwith; 


State_diagram sreg2 


state S4: 


if RESET then S4 with HPCHKn := 1; endwith else 

if 1PCHKn & SMCLK then S5 with HPCHKn := 0; endwith else 
if |PCHKn & ISMCLK then S6 with HPCHKn := 0; endwith 
else S4 with HPCHKn := 1; endwith; 


state S5: 


if RESET then S4 with HPCHKn := 1; endwith 
else S7 with HPCHKn := 0; endwith; 


state S6: 


if RESET then S4 with HPCHKn := 1; endwith 
else S5 with HPCHKn := 0; endwith; 


state S7: 
goto S4 with HPCHKn := 1; endwith; 


equations 
(Vari, HADSn, HPCHKon, Var2, Var3, Var4].clk = CLK; 
ICPUMISSn = BREQ&HLDA; 
f1EBCKENn = !KENn &!WRn & MiOn; 

end PLD22; 
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module PLD23 


title ‘DRAM CONTROLLER - PLD 23 
INTEL CORPORATION, August 1992' 


, Combine and genrates BRDY# to CPU and CPURDY# for System (EBC)" 
PLD23 device £0320’ 
x, ¢, Z = Pica hat aes 
" Inputs 
CLK pin : "CPU input clock" 


, 
SMCLK pin Fg “Delayed 33-MHz CLK (skewed by 5 ns)" 

HERDYOn pin 3; “Host Bus Early READY Output from EBC 

LOCRDYn pin 4; “READY output used when programming decode SRAM 
MRBRDYn pin 5; “"BRDY# for CPU Read Cycles 

MWBRDYn pin 6; "BRDY# for CPU Write Cycles 

RESET pin i “CPU Reset Signal 


“ Outputs 


P5BRDYn pin 18; “Combined BRDY# generated for CPU 
CPURDYn pin 16; ‘ 

Var1 ) pin 13; "State Variable 

Var2 pin 12; "State Variable 


" State Register Assignments 


sreg = (Var1, Var2]; 

10) = [0, 0]; 

S1 = [0, 1]; 

S2 = Me il 

S3 = [1, 0); 

HRDY = ISMCLK # (HERDYOn & LOCRDY)n); 


state_diagram sreg 


state SO: 
if RESET then SO with CPURDYn := 1; endwith else 
if |PSBRDYn.FB & SMCLK then S1 with CPURDYn := 0; endwith else 
if |PSBRDYn.FB & ISMCLK then S2 with CPURDYn := 0; endwith 
else SO with CPURDYn := 1; endwith; 


state S1: 
if RESET then SO with CPURDYn := 1; endwith 
else S3 with CPURDY)n := 0; endwith; 


state S2: 
if RESET then SO with CPURDYn := 1; endwith 
else S1 with CPURDYn := 0; endwith; 


state S3: 
goto SO with CPURDYn := 1; endwith; 


equations 
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intel ; ) _ AP-478 
module PLD24 


tile | ‘DRAM CONTROLLER - PLD 24 
INTEL CORPORATION, August 1992" 


Write Enable Generation PLD for the DRAM Array. It also generates the Row Select 
Signal to switch the address multiplexors from row to column address 


PLD24 device 'E224C'; 

x, 6,2 = mT ae i 

4 Inputs 
CLK pin "CPU input clock" 
RESET pin Ya “CPU Reset Signal" 
CIPn pin “DRAM Cycle in Progress 


WRn pin : "Write/Read Indicator 
BANKSEL pin ; “Bank 0 or Bank 1 Access 


2 
2 
3 
DRAMCSn pin 4; "DRAM Chip Select 
5 
6 
WRTPROTn opin ti: "Indicates Write Protected Address 


Won pin 16; “Write Cycle Indication for Bank 0 

Win pin 9; "Write Cycle Indication for Bank 1 

RASAOn pin 10; “RAS# signal for lower half of 512K in Row A 

RASAIn pin +1; “RAS# signal for upper half of 512K in RowA 

RASBOn pin me “"RAS# signal for lower half of 512K in Row B 

RASBin pin 13; “"RAS# signal for upper half of 512K in Row B 
“ Outputs 

WEOAn pin 20; “Write Enable for Bank 0, Row A 

WEOBn pin 21; "Write Enable for Bank 0, Row B 

WE1An pin 23; “Write Enable for Bank 1, RowA 

WE1Bn pin 24; "Write Enable for Bank 1, Row B 

Var1 pin 18; “State Variable 

Var2 pin 19; "State Variable 


ROWSEL1 pin 25; “Row Select to switch address multiplexors — 


state_diagram [Var1] 


state [1]: if RESET then [1] 
else if !CiIPn & IDRAMCSn & WRn & IBANKSEL & WRTPROTn then [0] 
else [1]; 

state [0] : if RESET # WOn then [1] 
else [0]; 


state_diagram [Var2] 


state [1] : if RESET then [1] 
else if !CiIPn & IDRAMCSn & WRn & BANKSEL & WRTPROThn then [0] 
else [1]; 
state [0] : if RESET # Win then [1] 
else [0]; 
equations 
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[Var1, Var2].clk = 


WEOAn 
WEOBn 
WE1An 
WE1Bn 
ROWSEL1 


end PLD24; 
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CLK; 

Var1.FB; 
Var1.FB; 
Var2.FB; 
Var2.FB; 


IRASAOn # !RASA1n # !RASBOn # !RASBI1n; 


241562-82 


PRELIMINARY : 


in a]. | | AP-478 


241562-83 


OTACTOSON 


ee 
« 2 
RGGRGGERGRGE 


A(O02:31) 
BE(O:7)¢@ 


D 
8 > 


RRR TTT ReSfhhidi mu 


Sek 


HU 


$855 35886 


SsOtwo 


This drawing contains information which has not been verified as viable for manufacturing an end user ready product. Intel is not responsible for the misuse of this information. 


PRELIMINARY : 3.117 


BLL-E 


AQYNINMITI 


x-BUS 


LOGI 


HOS 
BUS 


ott —_REZRESH 


82357 (IeP) 


IXDO 6 4, | 

XD (001 07) BXDO7 43 | 
| 29 SRE2 + | 

cso 79 SBES UB SRBE(O:3)4 


il Yay iad Ae RS 
NMI 
CPUMIss#® 
PARITY# 4 


SLOWK 
WRe 


DACK([0:3)#-.DACK(5:7]) # 


MACK(O:5)4@ 


DRQ(O:3)] -DRQ(5:7) 


HBE(O:3)@ 


10K 


Us 


(ERC) 


diddaduddduadudiadad 
: 


SDCPYENOi # 
SDC PYENO2 # 

SDC PYENO3S 

7 2 avi 3 
SDCPYUP 


SDHDLE 


BCLK 


BOsOLES # DATA 


SDHDL& 

2 SDOEO 
SDOEi 4 
SDOKE2# 


HDSDLEi # 
[DO EO 
HDO Ei # 
ERCKE 


BUFFER 
CNTRL 


This drawing contains information which has not been verified as viable for manufacturing an end user ready product. Intel is not responsible for the misuse of this information. 


241562-84 


8Zb-dV 


AP-478 


“UONBWUOJU! SI4} JO BSNSIW OY} JO} BjqiISUOdsel jOU SI }9}U] “ONPOJd Apes, Jesn pus ue HuUUNJOBjnUeW 40) BIGeIA Se PaijJOA USEQ JOU Sey YOIUM UONPWOJUI SUTe}UOD Bulmelp SIU, 


S8-c9Sl v2 


st. 
—< | ME SL at 


@NHOdHN 
a4 saan) ig) Ee 
x Yeaan ty poe —«d|) «sanadoos 
e6N _£ )tacn  e) LU 
st oacan ae #¢ thsraoas 
<a (s*o0laan ee S| *#SONSA42D08 
*3a0 4 ae tONSZAAOaGS 
#ALIUWA & e i.e TON at] 
eLave Lae *# tz IaecH 
os enone 34 ; 
+ -@i ~ a ae a IcHas 
: * pesado 
_— uae Sta Tana CE 
oz0cH 
s omMce 
> Ota oer ZOCaH 
ee lel * em IGHaSst 
#s#Ozoas 
e#txoasa 
[oe ) 
ser CSCC EOE 
focas Oe ae 3 OSES | 
fecas «ee ae sean | 
TAL) LS Pie —s Lean | 
[seas pis ean | 
iszas poe CS EH 
fecas ot] Pw __ > ean | 
CL CLL vec ean | 
fees. _ 89) Liw 27-8 
bil 7-2 SL 5 -: 
foeas __— | [a __—Oean | 
fétas | Poe étan | 
istag 8 Li 
A La [ee evan | 
fstas se] pe __—sa tan | 
istas =| Pe stam | 
jetas SS C.J 2 - | 
fetas ___—>e| sts tam | 
fetas ise] Pot se tan | 
~ ‘trae. ae] Tg a $C 
fords ee [tt _ Otan | 
Co Por é0an | 
fecas. ee Ptr _20cn | 
Pa i J {oe SCO | 
fs0as  __—*Es| EI - E 
fsogs (v__Soan | 
iyode 5 eS Ca -- 
fscaus 8] Ca xX -- 
ftoas = fe" ___toan | 
ooas oodH 
{ter:oo0las (@@m)>) escere (te oo)am 


en] 


Tie s Stau 
etc 
oowve. 
x Sianas 
76 
. 
<< | ry aetie 
ca 
corve. 
ad @#cataMus 
een 
soeve. 
< yt wit 
een 


3-119 


PRELIMINARY 


AP-478 _ Fi intel. 


241562-86 


HD[(CO:31) 


FL FEFEEFEREEEEFERETEEREEEEFEEFEEEE 
olf 
21333 ETETEEEEEEEEEEEE SPRETEePETPr ET EEEHEEECEELLE 


a FETE EEFEEFEEFEEEEEEREEREEEEE i | 


i 
3{t 
13 


RETEETEEREREEEEEEEEEEEEEEEEE HEHEHE 4423 HEA, 


3-120 PRELIMINARY 


AP-478 


3-121 


L8-29SLv2 
49 
240084 Sj 
24008 94a 
ae : 
3° = od 
ONZO 
eOUun oadoa 


Ceo*eclanxoce 


8S canoe 


OS 40 OB OS OE CU OES 
eve 

eves 

9094e404 


Ces'eela 


(te'00I3q 


PRELIMINARY 


AP-478 


88-c9Sl ve 


4£i£n 


sen 


(se'eeclanxta 


tes'esia 


(ts coolants 


eviv ov cvevev eves 


cn 
. iY iY te \s 5 * 


{teroola 


PRELIMINARY 


3-122 


ASV NIMITZ) 


Ecl-E 


A(O3:23] 


ROWS ELI 
REFR 


v4e2 
nm 20 
B. 


BE(O.7})4 


74ar244 U52 RP10 
i « La -yv vy Mubd 
i mers ogee eer ren OE ss 
B rvs cv IR ed BS 
— a I PrN 
° i. ae bet Ad az 
== sae ion [ae 2 
ee ey lm: a 
ae 1 REF ees oe Mw 
ee aman. es EE aE ik uUs6 =e 10 OM 
oe eee a a MONE a A A =) — 
Pees a i ee he ees OT 
a ee a a eer ee 
a a Mn Fe tio EB ! ped se aa. 7éynes 
aa Zener 
PER 
 memgee 
les oF uss ae 
; TTL. ww 
oa Bee ee eee ee Fr AR 
ee a a ee nS we eR oa a WA IED 
a ss Ei a Bakes aT ; Aitken 
et a A ee ee, he es 
a) ae ae a Pe be ee vse WW 
ae ae foe a eT ay, 
en A ae a fat. Te ee? Cs pep 10 ONM 
at ee ee EW SRE oe 
oom wed 
Ee oo 
Pepi. .< ad 
<—s ae eee 
: BERN HRED RI 28k ee ww 
Sarr ROM AO mr om arn nie eS core eg A OY UC DAMN 6 
ee nn SGS00 See ee eee eee ae ne Peres 
penne gee anpen TTI Titles) Wee 
mee py TLL. tat ra oa ; Maine 
SOP SRE ol RTD 
a ORE Bo WW 
eae rer) Tirlt sai 2s - 
PWR vei hasaae 
“Jos Xes** ars rt 4a | 210 OM 
6 Fey 
ES fines 
: isa 
ROWSEL : 


BEOS 


2 aa] CT 
ues PETS: 
=e ae | ener ar 


TULL 
| oo 
ELEEEEE 
i : 
Ha 


2 
BOBMAi 
BO. 2 
P-T-} 

BOAMAS 
BORMAS 
2 O. 4 
BOBMA4 


BiAMAS 
Bik Ss 
3B [AS 
BIiBMAS 
BiAMA7 
BiRMA? 
Bi. s 
BiBMAS 


This drawing contains information which has not been verified as viable for manufacturing an end user ready product. Intel is not responsible for the misuse of this information. 


241562-89 


8Lb-dV 


o 
i- 


“UONEWUOJUI! SIU} JO BSNSiw ey} JO} ejqisuodse OU SI je}U; ‘YONPOd Apes sesn pue ue BuUNOeynUeW JO} BIGeIA SB PEYeA USE }OU Sey YOIUM UONBWIOJUI SUIe}UOCD BuIMeup Siu] 
06-c29Sl 72 


sen 


<4q0a 
ao 


4q08 


eax 
eee 
oer 
vex 
on 
cewe 
eseve 
eteve 
eoeve 
ecove 
etevs 
eseve 
eoave 


Pa]. cna 


VIWWTZVTI i 


a 


a 


s 3 
eta 
obe 
ota 
ada 
ote 

te 
ede 

ba 


? 


eez 
cez 
vez 
tes 
en 
@eevo 
@egeve 
eteve 
eosve 
¢cove 
ewvs 
Cove 
oove 


agzzzazay F222 TERR s aaas 


PVVIITIG7 


SAa-TtA wos 
SCXATITS zs Za AWM SRP INCOK MU 
tog MoE PEXNETTS WO SEXHOSSS WEMIIZ *BLON 


(es ocolamoa 


AP-478 


PRELIMINARY | 


3-124 


intel.  AP-478 


241562-91 


gq@aneganaereeocdanenaereeotheanenevreaacda 


PG UHHD sna 


VIIWZTIT2 


ROoOWwBii 
2 


BiBxAC 
,] 


BiBMA[i:& 


ele 
1a 
mF: iaisist & 


eee 


eodanneaereeod 


aeaeanannmpaunaeananan 


e@eee @#ee208 


aaazeczar L222 2222, gags 


aR 


BimMp(0O:¢€3) 
This drawing contains information which has not been verified as viable for manufacturing an end user ready product. Intel is not responsible for the misuse of this information. 


2 PRELIMINARY | 3-125 


AP-478 


"UONPWUOjU! Siu] JO esNsiw au JO} ajqisuodsei }OU SI je}Uj “}ONPOId Apes sesn pue ue BuNjoejnueW 10) ajqeIA Se PEeYEA USE JOU Sey YOIYM UONeUWWOJUI SUIe}UOD Buimeup SIU, 


26-c9Sl¢e 


eCte*vciwa 


Cét'colys 


| 


e(tererierz 


(¢e* solves 


7 Stau 
[—_étva tt) hac: otau a 
eo = 
ae SY e etc 
ae 2 NF ie 
ee ik ( , 
pS tvs oe) Heeb 
—etwe #5) re 
- kena 

ia I baal 5 Oat 
[—_‘ttve _ te} 
a-ha : — 
Neem em ne 
oe ery 
__ cove ver! = 
Y___ vows  _c¥F] *- pa Tee 
ics COUN + <Pe Overt 

zove 
Y_etews 6] eo ter 
P_eeews__*ty *- a Xs 
Y_eeeea tt Ls i a 
1 ie CL x3 ae 
Pp eeeea pve sewn 

eocw’ : 

oF r——soEne | 
2. eee 
__sewy_¥F7 Lk: 
ee: io se 4 
Otero) Lie eS 
SESE 2: reste 
ee A i ee 
aren . cen] 
sD ve ————"s tw 
we — | 
wee eel i oi | 
RR 5c a eee 
SR i OL ee ——"ztt 
re ble b - 
PER a: LL <. = iOpen | 
sows a) pte "sown 
a oso | 
a ove — 2. | 
Y__eowa ty "orn | 
ae aoe 


tte+eol] we 


PRELIMINARY 


3-126 


ABVYNIMI Tad 


Lob-€ 


A(1i3 +32) 


MIOS 


Dl00:03)} 


R43 RIS RES 
20a. 20m Ls 


va7 ve7 


boo 

pos a! 
[Doz ¢| 
a 


DRAMDIS# 


(Oa a ae a WR p 5 
eT LS Oe ee [a2 | 
RESET 
. é 747244 
D cse 
ee RE, TS 


[see 2 nee aes SS eee RI PROT = 


nwesc2a20 


v6¢é 


DRAMCSi# 
WROe 
WRiI® 

ret 


This drawing contains information which has not been verified as viable for manufacturing an end user ready product. Intel is not responsible for the misuse of this information. 


241562-93 


8Lb-dV 


S| 
7s ae 


AP-478 


v6-c9Slve 


= 
= 
=z 
7 
= 
= 
/s 


Zeocen 
s 


io 
~ 


eMLIMN 


. 
c¥s* sole (ts'solwn 


PRELIMINARY 


3-128 


ABV NIMITZ) 


6cl-€ 


Nesc22vio 
Ne5C220 
NBS5C224 


tf == 


IE 


nNesc2a24 


nNe5C224 


ali ers 
161i ere 


° 
d 
> 
« 
a 
v 
0 
© 
Z 


cLKS 


nNesca24é 


241562-95 


8Lb-dV 


AP-478 aes intel. 


241562-96 


. ° . 
1 i 
o| a : 
a a D 
* § 
nace 9 " z “ 
a ad d a i} ry 
® % * ° a a 
a a a a la 


PLDI¢€ 


MRBRDY# 
crPe 
c) 
LDAS 
HITS 


s 
a 4 
a 


3-130 PRELIMINARY 


AP-479 


APPLICATION 
NOTE 


Pentium! Processor 
Clock Design 


DERRICK LIN 
JIM REILLY 


November 1993 


Order Number: 241574-001 3-131 


PRELIMINARY 


Pentium™ Processor Clock Design 


CONTENTS PAGE 
1.0 INTRODUCTION ................... 3-134 
1.1 General Clocking Issues ......... 3-134 


2.0 Pentium™ PROCESSOR, 82496 
AND 82491 SYSTEM CLOCK 
SPECIFICATIONS ................... 3-135 


3.0 AVAILABLE CLOCK DRIVERS .... 3-140 


4.0 CLOCK GENERATION FOR THE 
Pentium™ PROCESSOR AND THE 


CPU-CACHE CHIP SET ............. 3-144 
4.1 Clock Generation for Fully 
Synchronous Systems ............ 3-145 
4.2 Clock Generation for Divided 
Synchronous Systems ............ 3-145 
4.3 Clock Generation for 
Asynchronous Systems ........... 3-149 


5.0 Pentium™ PROCESSOR WITH 
256K 82496/82491 SECOND LEVEL 
CACHE CLOCK DISTRIBUTION 


DESIGN EXAMPLES ................ 3-149 
5.1 Clock Routing for the 256K CPU- 
Cache GRID S6t .. on. cc cess veecesus 3-149 
5.2 Analysis of Drivers Used in 
Se nn oo a aa ee 3-155 


6.0 Pentium™ PROCESSOR WITH 
512K 82496/82491 SECOND LEVEL 
CACHE CLOCK DISTRIBUTION 
Ps ois 3 clive noukeuhck. faceeuars 3-165 


7.0 CLOCK DISTRIBUTION FOR THE 
Pentium™ PROCESSOR WITH 
OTHER SECOND LEVEL CACHES .. 3-165 


OD BURMA bats ciedelisevene seashore’ 3-165 
BO FECEMENGES 6 vch 6s dene dsr stevens 3-165 | 
APPENDIX A. CLOCK DRIVER 

MANUFACTURERS ................. 3-166 
3-132 


CONTENTS 


FIGURES 
Figure 1 


PAGE 


Common Termination 
TQCTTIUIOS. oot ne Reees anat ad: 3-135 


Clock Requirements for the 
Pentium™ Processor and 
CPU-Cache Chip Set ........ 3-137 


An Example of an Acceptable 

Clock Waveform (Diodes Are 
Absent from the Input 

Model) .......... Ce Pee 3-138 


An Example of an Acceptable 
Clock Waveform (Diodes Are 
Present in the Input Model) .. 3-139 


An Example of an 

Unacceptable Clock Waveform 
(Diodes Are Absent from the 

eli gue || ar 3-140 


A CPU Module with the 

Pentium™ Processor, 82496 

and 82491 CPU-Cache Chip 

Me SD attak ccs pun ae ia ele 3-144 


Examples of Clock 
Generauon «2... ésiweds «chew tt 3-145 


Clock Generation Using Clock 
GRE Sc ha aaa cies (emcees 3-146 


Clock Generation Using Clock 
PORTE . wp dieaaps SUA h Gea eons 3-146 


Figure 10 Clock Generation Using Clock 
GPURG . Fu veh ew aba apeky dese 3-147 


Figure 11 Clock Generation Using Two 
PLES Aastha nel ele cleus 3-147 


Figure 12 Clock Generation Using Two 
cg hl? Yotery oat So eee 3-148 


Figure 13 Pentium™ Processor, 82496 
and 82491 Clock Input 


Figure 2 


Figure 3 


Figure 4 


Figure 5 


Figure 6 


Figure 7 
Figure 8 


Figure 9 


REI E. os ccwes ns cu hbase dads 3-150 
Figure 14 CLKO Layout for 256K Chip Set 

WI ON gas 0s $ Rsro os 0nd 3-151 
Figure 15 CLK1 Layout for 256K Chip Set 

WE wk han aint ats 3-152 
Figure 16 CLK2 Layout for 256K Chip Set 

WME fc vcbvecdehiescexas 3-153 | 


CONTENTS 


FIGURES 


Pentium™ Processor Clock Design 


PAGE 


Figure 17 CLK3 Layout for 256K Chip Set 


Figure 18 
Figure 19 
Figure 20 
Figure 21 


Figure 22 
Figure 23 
Figure 24 


with Parity 
Motorola Waveform 
National Waveform 


Vitesse (Slow) Waveform .... 3- 


Vitesse (Slow) Waveform 
(Continued) 


Vitesse (Fast) Waveform 
Triquint Waveform 
Triquint Waveform (Contd.) .. 


3-154 


CONTENTS PAGE 
TABLES 
Table 1 Clock Signal Quality 

Specifications ................. 3-136 
Table 2 Clock Signal Quality 

CUNGONTIOS 5c. c.c.s-c oy 0s goes anak 3-136 
Table 3 Clock Driver Options ........... 3-141 
Table 4 List of Clock Doubler Parts ..... 3-148 
Table 5 List of Clock Divider Parts ...... 3-148 
Table 6 Interconnect Characteristics ... 3-155 
Table 7 Compilation of Simulation 

LgRAR Ash tok) ahead. dle See 3-156 


Table 8 


Series Termination Resistor 
Values for Each Line 


3-133 


AP-479 


1.0 INTRODUCTION 


Today’s high speed microprocessors place a heavy de- 
mand on clock generation and distribution. To main- 
tain a synchronous system, well-controlled and precise 
clocking solutions are required. Pentium™ processor, 
with operating frequencies of 60 MHz and 66 MHz, has 
tight system clock specifications. In order to bring 
clock signals of acceptable quality and minimal skew to 
the Pentium processor and the rest of the system, sys- 
tem designers have to contend with high speed issues 
for clock distribution and limited number of precise 
clock driver devices. In this application note, the key 
issues in the design of a 60 MHz or 66 MHz clock for a 
Pentium processor-based system will be discussed, 
available clock drivers will be listed and discussed, and 
detailed design examples of a clock solution for the 
Pentium processor with 256K second-level cache sub- 
system, using the 82496 Cache Controller and the 
82491 Cache SRAMs, are provided. 


The Pentium processor, 82496 Cache Controller, and 
82491 Cache SRAM form a CPU-Cache core or chip 
set. Along with a memory bus controller (MBC), the 
chip set provides a CPU-like interface for many types 
of memory buses. 


This application note is intended for system designers 
concerned with clock generation and distribution for 
the Pentium processor and CPU-Cache chip set based 
systems. It reflects data collected from several quarters 
of characterization of the Pentium processor and expe- 
rience with some of the clock driver devices, as well. 
This application note gives readers a good understand- 
ing of the issues and solutions of high speed clocking, 
particularly that for the Pentium processor. The reader 
should be familiar with the Pentium processor and 
CPU-Cache chip set electrical and mechanical specifi- 
cations, Clock Design in 50 MHz Intel486™ Systems, 
and transmission line theory. If not, please read materi- 
als listed in Section 9.0 before proceeding. 
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1.1 General Clocking Issues 


There are two major problems with distributing clock 
signals at 66 MHz: clock signal quality and clock skew. 
At high speed, one set of effects which has been minor 
in slower designs is now significant—the effects of 
transmission line. At high frequencies and fast edge 
rates, long traces behave like transmission lines. The 
“lumped” circuit assumption which assumes instanta- 
neous signal transmission is no longer valid. Instead, 
signals travel in a finite time. When a transmission line 
is not properly terminated, one can observe severe over- _ 
shoot, undershoot and ringback, all of which degrade 
logical signals. Bad signal quality can cause false 
switching or multiple switching, and can in extreme 
cases damage the devices. To maintain a clean clock 
signal, designers must consider clock driver characteris- 
tics, signal routing, load characteristics, and transmis- 
sion line termination. . 


There are four basic ways to terminate a transmission 
line, series, parallel, Thevenin, and AC terminations 
(Figure 1). Series termination is recommended when 
driver output impedance is less than the transmission 
line characteristic impedance (true for most TTL driv- 
ers) and the line is driving a small number of devices. 
Series termination consumes low power and uses only 
one device; however, the termination method increases 
signal rise and fall times. Series termination ensures 
good signal quality by eliminating secondary reflection 
off the driver end. The rest of the termination methods 
eliminate reflection at the load end. All of the termina- 
tion methods can provide good, clean clock signals at 
the load. Both parallel and Thevenin terminations con- 
sume a large amount of power. Thevenin termination 
consumes less power than parallel but requires one 
more device. AC termination consumes low power but 
adds capacitive load to the driver and delay due to RC 
time constant. Design examples provided with this ap- 
plication note use series termination. For more infor- 
mation on transmission line effects and design issues, 
please refer to [ref. 3, ref. 4, ref. 5] 
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Figure 1. Common Termination Techniques 


Skew is defined as the time difference between when the 
clock signal reaches each component. As frequency in- 
creases, there is less and less time for computation in a 
given clock period for a synchronous design. For a typi- 
cal design, the time from one rising edge to the next is 
composed of the largest path-delay, setup time, propa- 
gational delay through logic elements, and skew. Clock 
skew then, takes away from the time available for prop- 
agational delay, thereby restricting the amount of logic 
done in a clock cycle. For high speed designs, skew 
must be minimized. 


To minimize skew, designers must tune clock traces so 
that the propagational delay from driver through each 
trace to load is the same for each load. For balanced 
loads, tuned traces have same lengths. For unbalanced 
loads, trace lengths can be adjusted to make up for 
loading differences. If possible, designers should try to 
keep the loading on each clock line the same. 
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2.0 Pentium™ PROCESSOR, 82496 
AND 82491 SYSTEM CLOCK | 
SPECIFICATIONS 


System clock specifications can be divided into 2 cate- 
gories: signal quality requirements and skew specifica- 
tions. Clock signal quality requirements are the same 
for the Pentium processor and CPU-Cache chip set. 
Skew specifications are only required for CPU-Cache 
chip set. 


Signal quality requirements define boundaries for ac- 
ceptable signal shapes and levels. There are two parts to 
signal quality requirements: signal quality specifications 
(Table 1) and guidelines (Table 2). Please refer to the 
latest revision of the Pentium processor and CPU- 
Cache chip set specifications for more details and for 
the most up-to-date information. 
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Table 1. Clock Signal Quality Specifications 


Be ir one () 

a See fea ie ace 
eee Ae 
pth | CLK Rise Time 
pte | CLK Fall Time 
Rte SR aes 

Ua eee (ne ra 

NOTES: 


. Below 66 MHz only functionality is guaranteed. 

. High times are measured between 2.0V crossing points. 

Low times are measured between 0.8V crossing points. 

. Rise and fall times are measured between 0.8V and 2.0V. 

. Symbols in Figure 2. 

. Functionality is guaranteed by design/characterization. 

. Measured on rising edge of adjacent CLKs at 1.5V. 

. To ensure a 1:1 relationship between the amplitude of the input jitter and the internal and external clocks, the jitter 
frequency spectrum should not have any power spectrum peaking between 500 KHz and 1/3 of the CLK operating frequen- 
cy. 

9. The amount of jitter present must be accounted for as a component of CLK skew between devices. 


(6), (7), (8), (9) 
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Table 2. Clock Signal Quality Guidelines 


V 
Saad. "AG RR Ae MT 


NOTES: 

1. Overshoot (undershoot) is the absolute value of the maximum voltage above Vcc (or below Vgs). The guideline assumes 
the absence of diodes on the input. 

2. Ringback is the absolute value of the maximum voltage at the receiving pin below Vcc (or above Vss) relative to Voc (or 
Vss) level after the signal has reached its maximum voltage level. The input diodes are assumed present. 


The overshoot guideline should be used in simulations, 
without diodes present, to ensure overshoot (under- 
shoot) is within the acceptable range. The ringback 
guideline is provided for verification in an actual sys- 
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tem. System designers do not have to worry about ring- 
back if the signal does not overshoot or undershoot, 
respectively. Figure 2 summarizes clock waveform re- 
quirements listed in Table 1. 
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Figure 2. Clock Requirements for the Pentium™ Processor and CPU-Cache Chip Set 


Figure 3 to Figure 5 illustrates examples of acceptable 
and unacceptable clock waveforms. Waveform in Fig- 
ure 3 is for an input model without diodes. Waveform 
in Figure 4 is for an input model with diodes. The di- 
odes clamp the voltage and prevent it from going more 
than a diode drop above Vcc or below Vss. Waveform 
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in Figure 5 is for an input model without diodes. The 
waveform is not acceptable for several reasons. It vio- 
lates the minimum low time specification (4 ns), the 
maximum fall time specification (1.5 ns), and it does 
not follow the maximum undershoot guideline (1.6V). 
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Figure 3. An Example of an Acceptable Clock Waveform (Diodes are Absent from the Input Model) 
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Figure 4. An Example of an Acceptable Clock Waveform (Diodes are Present in the Input Model) 
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Figure 5. An Example of an Unacceptable Clock Waveform (Diodes Are Absent from the Input Model) 


Clock skews for the CPU-Cache chip set are measured 
at 0.8V, 1.5V, and 2.0V on the rising edge. Worst case 
skew between the Pentium processor and the 82496 is 
0.2 ns, and worst case skew between any 82491 and 
either the Pentium processor or the 82496 is 0.7 ns. 


3.0 AVAILABLE CLOCK DRIVERS 


Intel has held discussions with many clock driver com- 
ponent companies. The intent has been to enable these 
companies to offer clock driver solutions that meet the 
Pentium processor specifications. It has also been to 
ensure that the super set of these companies can pro- 
vide support and distribution worldwide on a schedule 
that closely matches the Pentium processor’s availabil- 
ity. Based on information available, Table 3 lists a num- 


ber of companies who are planning to offer solutions to . 


meet these requirements. All the clock drivers listed in 
Table 3 have maximum output frequency equal or 
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above 66 MHz. Preliminary data sheets show that solu- 
tions listed in Table 3 meet the CPU or CPU-Cache 
chip set requirements. The specifications listed are 
based on preliminary data provided by each company 
and may be subject to change. Designers should contact 
each company for the latest specifications and availabil- 
ity. Some evaluation has been done by simulating an 


‘example clock layout using output models supplied by a 


subset of the companies listed, along with interconnect 
models and preliminary clock input model of the CPU- 
Cache chip set. For more detail on the simulations and 
example routing, please see Section 5.0. Intel has been 
and will be working closely with the listed companies to 
ensure they have the latest specifications for the Penti- 
um processor. With published preliminary data sheets, 
all the listed parts meet either the CPU or the chip set 
clock specifications (including the signal quality and 
skew specifications). Please contact individual manu- 
facturers for data sheets and sample availability. 


PRELIMINARY 


Table 3. Clock Driver Options 


t,/ ty 


(0.8V-2.0V) secre 
(ns) 


Stability 


# of Outputs 
(per pkg) 


Spec’d Max. 
Loading Freq. 


Driver Level 
Type In/Out 
ee cee on 


AMCC SC35XX-1 Buffer PECL or 
TIKET TE 
soured mur 


ASYNIMI Tad 


ae at 
1.5/1.5 20 Outputs 
Which Vary 
with Part # 
p10 | 155 |__| 6-120utputs__ | a5pF_— | 80MHz 
AT&T (7) DA400 PLL PECL or £.5/1.5 8@1, 0.5X 50 pF 
TTL/TTL (Prog Shift) 
Cypress CY7B991 TTL/TTL 1.2 (6) 1.5/1.5 0.5% 4@1X 502/ 80 MHz 
4@1, 0.5, 0.25X 30 pF 


PLL 


85C224-100 Divider/ TTL/CMOS Divider 0.4 NA Divider 700/ 
Buffer/PLD Buffer 0.5 1.2/1.1 50 pF 
Buffer 
1.4/1.1 
85C224-10 Buffer/ TTL/CMOS Divider 0.4 NA Divider 700/ 
Divider/PLD Buffer 0.5 1s T1 50 pF 
Buffer 
1.4/1.1 


Motorola MC10H646 PECL or :2/1:2 NA 500/ 
THAT A. 50 pF 
: ae oe 7 : 


Divider 
100/50 
Buffer 133 


Divider 
58/29 
Buffer 100 


NA 2.5/2.5(11) A 5@1X 5 66 MHz 
1@2X 
1@.5X 
1 Inverted X 


LV L-€ 


62b-dV 


= 


CV L-E 
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Table 3. Clock Driver Options (Continued) 


62b-dV 


# of Outputs Spec’d 
(per pkg) Loading 


12@1, 0.5, 2X 
(Prog Shift) 


Not 
Spec’d. “ 


5@1X 
4@0.5X 
2@0.5X (Prog 
Shift) 


GA1086 


GA1087 


2@1X 50 pF ~ 70 MHz 
6@1, 2, 4x 


NOTES: 

1. 0.7 ns between Pentium processor-82491, 82496-82491, 82491-82491. 0.2 ns between Pentium processor-82496. Assumed 0.5 ns between clock driver outputs, leaving 0.2 
ns for routing or trace skew. 

2. See complete specification in Table 1 or the data book. 

3. Manufacturers listed in alphabetical order. . 

4. Contact manufacturers for price and availability information. 

5. Intel does not guarantee specifications for other manufacturer’s devices. All clock driver specifications listed were provided by the manufacturer and are subject to change. 
Designers should contact the manufacturer for the latest specification/data sheet information. 


6. As low as 0.75 ns in some configurations. = 8 
7. First samples in March ’93. Specifications may improve during characterization. > 
8. Other Solutions are under development. Contact TI for preliminary details. 

9. Maximum phase erro quoted in the manufacturer’s data sheet for the entire frequency range. 

10. Other configurations available. Contact Triquint for details. 


11. Between 0.2 Voc and 0.8 Voc. Contact Motorola for details between 0.8 and 2.0V. 
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AMCC offers the SC35XX-1 series of buffered clock 
drivers and the SC44XX-80 series of PLL based clock 
drivers. The SC35XX-1 series must be driven with a 
TTL or PECL 2X frequency input. Each member of 
the series provides 20 outputs. Depending on the specif- 
ic part within the series, these 20 outputs can be config- 
ured to provide the primary frequency, 1/2, or 1/4 the 
primary frequency. The SC3502-1 even provides 5 in- 
verted outputs of the primary frequency. The SC44XX- 
80 series must be driven with a TTL input. The PLL 
design allows for very low skew (+ 200 ps) between the 
outputs.Different members of the series offer different 
numbers and configurations of outputs. Between 4 and 
8 outputs are available at the primary frequency. These 
devices also allow a subset of the outputs to be config- 
ured for 1/2X or 2X the primary frequency. In addi- 
tion, the PLL allows the outputs to be skewed in phase 
from one another. 


AT&T DA400 is a PLL clock driver. Its inputs can be 
driven by TTL or PECL levels. Eight outputs are pro- 
vided. They can be configured for the primary frequen- 
cy or 1/2X the primary frequency. In addition each 
output has a programmable delay line which allows 
1/32 or 1/64 increments of the clock period of delay 
between outputs. 


Cypress’s CY7B991 is a PLL clock driver. It requires a 
TTL input and is able to drive 8 outputs. A subset of 
the outputs can be configured as 1/2X, 1/4X, or invert- 
ed outputs. As with other PLL solutions, the skew 
between outputs is small and the outputs can be config- 
ured for a fixed amount of delay or skew between out- 
puts. 


ICS’s ICS2686 is a PLL clock driver. Five outputs are 
available. Both primary and 1/2X frequencies are avail- 
able. The ICS2686 has been designed to work with the 
74ABT240 type buffer to provide more than 5 outputs. 
A unique feature of the ICS2686 is the multiple feed- 
back inputs. This feature allows synchronizing multiple 
outputs at their destination or load with the input 
clock. 


Intel’s 85C224-100 is a “20V8” architecture program- 
mable logic device. From its TTL inputs it provides 8 
TTL outputs which can be configured to provide 1X, 
1X inverted, and 1/2X versions of the primary frequen- 
cy, in any combination. When programmed to function 
as a frequency divider, the primary frequency can be as 
high as 100 MHz and the 1/2X frequency outputs will 
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maintain output skew below 400 ps. When pro- 
grammed to operate as a straight 1X buffer, it supports 
frequencies of up to 133 MHz with less than 500 ps of 
output skew. The 85C224-100 provides a combination 
of superior output signal quality including fast rise and 
fall times and low output skew. A particularly unique 
feature of the 85C224-100 is in its programmable logic 
circuitry. Its flexibility satisfies programmable logic 
needs such as control line signals and widespread glue 
logic. With this minimized output skew PLD, a single 
28-pin PLCC can provide low output skew clock distri- 
bution, frequency division, and programmable logic; for 
the low price of a 20V8 PLD. 


Motorola offers both a buffered and a PLL clock solu- 
tion. Motorola’s 10H646 is a buffered clock driver. It 
offers both TTL and ECL inputs which supports back- 
plane routing using ECL levels. The clock driver’s out- 
puts are clamped to 3V, not Vcc. 10H646’s output 
stage has similar rise and fall output resistances. Similar 
rise and fall output resistances makes series termination 
easier since the termination resistance is the difference 
between the characteristic impedance of the transmis- 
sion line connecting the output to the load and the driv- 
er’s Output impedance. 10H646 has 8 1x outputs. As a 
straight buffer, 10H646 does not offer any multiples of 
the input besides 1x. The Motorola 88915 is a PLL 
clock driver. It provides a 0.5 ns skew between outputs. 
The 88915 provides 5 1X outputs along with 1 2X, 
1 0.5X, and 1 inverted X outputs. 


National’s clock buffers are packaged to function reli- 
ably at high frequencies. Their output rise and fall re- 
sistances are approximately equal. The CGS74CT2524 
and 2527 provide 0.45 ns of output skew. The 
CGS74CT2528’s output skew, 0.55 ns, allows for only 
0.15 ns skew due to board traces or any unbalanced 
loading effects when using the 82496/82491 cache, 
however this amount may be sufficient for other cache 
solutions. These parts offer a range of 4, 8, and 10 out- 
puts. The CGS74CT2524 and 2527 have CMOS level 
outputs, which transition from rail to rail. 


Pioneer’s PI6B2407 is a PLL clock driver. From its 
TTL input, it provides twelve TTL outputs, which can 
be configured to operate at 1X or 2X the input frequen- 
cy. In addition, the outputs can be phase adjusted from 
the input clock..The PI6B2407 is able to provide 
+0.25 ns of skew between outputs while maintaining 
the fast 1.5 ns rise and fall times. 
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Texas Instruments’ ABT328 driver provides six outputs 
with an output skew of 0.7 ns. Please contact Texas 
Instruments for the availability of 0.5 ns output skew 
parts. 0.7 ns output skew is too large for the chip set 
application. In the design example on Section 5.0, 0.5 
ns output skew is assumed. As a buffered driver, the 
ABT328 offers only 1x outputs. 


TriQuint’s GA1086 is a Gallium Arsenide-based prod- 
uct. It takes a 66 MHz input and produces nine 
66 MHz outputs and one 33 MHz output. The avail- 
ability of a low skew 33 MHz output facilitates clock 
distribution for systems that have synchronous 33 MHz 
memory buses. Since the part is phase-lock-loop based, 
one of the outputs can be fed back to the input so that 
all the outputs are synchronized with the input clock. 
Such a set up is ideal for cascading clock drivers to 
achieve maximum fanout. The specified output skew of 
the GA 1086 is 0.25 ps, the smallest skew number avail- 
able. Triquint also offers the GA1085 and GA1087. 
These products are similar to the GA1086, however, 
they offer different combinations of outputs between 
1X and 0.5X. 


Vitesse’s VSL4485 is also a Gallium Arsenide-based 


product. It offers 1x, 2x, and 4x options on two of its 
eight outputs. Thus, to obtain both 33 MHz and 


Processor Card 


Clock out to 
components 


| 
| 
| 
| 
| 
| 
| 
| 


CLK Driver(s) 


ee 
Clock In 


a 4 

intel. 
66 MHz signals with low skew, for example, the clock 
input frequency of the VSL4485 can be 33 MHz. For 
the chip set application, two 66 MHz outputs are not 
enough, and thus cascading another driver is necessary. 
Alternatively, the input can be 66 MHz and all of its 
outputs can be at 66 MHz. It offers 0.5 ns output skew, 
and a low effective delay. In addition, VSL4485 can 


generate programmable, multiple phase relationships 
among its outputs. 


4.0 CLOCK GENERATION FOR THE 
Pentium™ PROCESSOR AND 
THE CPU-CACHE CHIP SET 


Clock generation is the generation of copies of clock 
signals from a signal oscillator or any other source 
which then are distributed to the various loads. The 
function of a clock driver is to generate multiple copies 
of clocks from a single source. In general, Pentium 
processor-based systems have three types of memory 
interface: fully synchronous, divided synchronous, and 
asynchronous. Each interface requires different meth- 
ods of clock generation. The basic setup of a processor 
card is illustrated symbolically in Figure 6. Depending 
on the configuration, the Clock In signal can come 
from the memory bus or a separate oscillator. 


Memory Bus 
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Figure 6. A CPU Module with the Pentium™ Processor, 82496 and 82491 CPU-Cache Chip Set 
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4.1 Clock Generation for Fully 
Synchronous Systems 


A fully synchronous system is one which everything in 
the system runs synchronous to the CPU. In particular, 
the memory bus interface is synchronous to the CPU. 
In Figure 6, the memory bus is at 66 MHz, synchro- 
nous to the CPU module. Clock In signal must be syn- 
chronous to the memory bus. Clocking for this case 
involves the generation of tightly controlled copies of 
clock signals that are distributed to all the clocked 
parts. The task of clock generation and distribution is 
the most difficult for this type of set up. All copies of 
clock signals must come from a single source, and must 
be deskewed appropriately. For Pentium processor- 
based systems that run at 66 MHz, the most critical 
parameter in choosing a clock driver is its output skew, 
as well as its part-to-part skew if more than one driver 
is needed. Since all the clock signals are at 66 MHz, 
only 1x outputs are needed. All of the drivers listed in 
Section 3.0 can be used here. 


For a fully synchronous configuration, it is likely that a 
single clock driver cannot provide enough copies of 
clock signals. Then, some kind of cascading of drivers is 
necessary. Figure 7 shows two ways of clock generation 
by cascading drivers. Tskew is the total worst case skew 
at outputs of CD2 and CD3. Tpp23 is the worst case 
part-to-part skew between CD2 and CD3. Tos?2 is the 
worst case output skew of CD2, assuming the worst 
case Output skew of CD3 is the same as Tos2. Tos! is 
the worst case output skew of CD1. Ttol2 is the feed- 
back tolerance of CD2. Feedback tolerance is the phase 
tolerance between the feedback input and the reference 
clock. Typically, Ttol2 is a small number. For the ex- 
amples in Figure 7, it is assumed that only the second 
level drivers feed the clock signals to the loads. Other- 
wise, for part a, signals from CD2 will be later than 
signals from CD1 by the propagational delay of CD2 
which is typically between 6 ns to 8 ns. 


For the examples in Figure 7 clock signals for the CPU- 
Cache chip set must be derived from one clock driver 
outputs only so that the 0.2 ns and 0.7 ns skew specifi- 
cations can be met. In part a, Tskew, the sum of Tpp23, 
Tos2, and Tosl is the worst case skew which is the 
skew between an output of CD2, and an output of 
CD3. The output skew of CD1 (Tos1) causes the inputs 
to CD2 and CD3 to arrive at different times. The differ- 
ence in propagational delay which is Tpp23, further 
skews the outputs of CD2 and CD3. If the part-to-part 
skew does not include output skew, different outputs 
from CD2 and CD3 can also be skewed by the output 
skew. For part b, Tskew, the sum of Ttol2, Tos2, and 
Tos1, is also the worst case skew between the outputs of 
CD2 and the outputs of CD3. Once again, Tos! causes 
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the inputs to CD2 and CD3 to arrive at different times. 
The feedback in CD2 synchronizes all its outputs in the 
input. The feedback output of CD2 is different from the 
input reference clock only by Ttol2. All the other out- 
puts are further skewed from the feedback output by 
Tos2. The analysis for CD3 is the same. 


PLL or 
Buffer 


CD 
a. Tskew = Tpp23 + Tos2 + Tosl 


CD2 


PLL or 
Buffer 


CD3 


b. Tskew = Ttol? + Tos? + Tosl 
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Figure 7. Examples of Clock Generation 


4.2 Clock Generation for Divided 
Synchronous Systems 


For a divided synchronous system, the memory bus is 
at half the speed of the CPU-Cache chip set; i.e., 
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the memory bus runs at 33 MHz for the Pentium proc- 
essor or the CPU-Cache chip set based systems. A 
33 MHz reference clock (Clock In) can come from the 
backplane from which all the clocks serving the CPU- 
cache module (Figure 6) must be synchronized. The 
memory bus controller (MBC) itself requires both 
33 MHz and 66 MHz clocks. For this configuration, 
clock drivers that can provide both 33 MHz and 
66 MHz outputs are needed. 


There are several ways of providing the two frequen- 
cies. They are shown in Figure 8 through Figure 12. 
Tskew is the worst skew between 33 MHz signals and 


Feedback for PLL locking 


Clk doubler 


NOTE: 
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66 MHz signals. The skews among 66 MHz signals or 
among 33 MHz signals are simply the output skew of 
the driving devices. Ttolpll is the PLL CLK doubler or 
PLL CLK divider’s feedback tolerance. Tospll is the 
PLL CLK doubler or PLL CLK divider’s worst case 
output skew. Tppbuf is the worst case part-to-part skew 
of the second level buffers. Those buffers can be phase- 
lock-loops also in which case Tppbuf is the feedback 
tolerance of the PLLs if feedback is used. Tosbuf is the 
worst case output skew of the second level buffers. Tos1 
is the output skew of CD1, Ttol2 is the feedback toler- 
ance of CD2, and Tos2 is the output skew of CD2. 


~ 
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Tskew (between 33 and 66) = Ttolpll + Tospll + Tppbuf + Tosbuf 


Figure 8. Clock Generation Using Clock Doubler 
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Figure 9. Clock Generation Using Clock Doubler 
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NOTE: 
Tskew (between 33 and 66) = Tospll + Tppbuf + Tosbuf 


Figure 10. Clock Generation Using Clock Divider 
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NOTE: 
Tskew (between 33 and 66) ~ Tos1 + Ttol2 + Tos2 


Figure 11. Clock Generation Using Two PLLs 
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Figure 12. Clock Generation Using Two PLLs 


Since the outputs of the first level PLL CLK doublers 
and dividers go directly to inputs of another clock driv- 
er, the signal quality requirements of these outputs are 
not as stringent as if the outputs drive the loads of the 
Pentium processor and others. One of the functions of a 
clock driver is to buffer and clean up a clock signal in 
addition to generating multiple copies of the same. 
However, the output skew of the PLL used for the first 
level is very important. Depending if feedback is used, 
the feedback tolerance is of importance. When choosing 
a clock driver, also be sure that its maximum output 
frequency is greater than 66 MHz for 66 MHz outputs 
and 33 MHz for 33 MHz outputs. The parts listed in 
Table 4 and Table 5 are examples of devices that can be 
used as first level drivers as illustrated in Figure 8 
through Figure 12. 


Table 4. List of Clock Doubler Parts 


Manufacturer | Part # Driver | # of Outputs 
Type (per pkg) 
Motorola 88915 PLL 1 @ 2x 
6 @ 1x (1) 
1 @ 0.5x 
88916 PLL 1 @ 2x 
4 @ 1x (1) 
1 @ 0.5x 
ae 
Vitesse VSL4485 | PLL 6 @ 1x 
2 @ 1, 2, or 4x 


NOTES: 

1. One of the outputs is inverted. 

2. This list is not meant to be complete. Other solutions 
may be available. 


Motorola 
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The phase-lock-loop drivers listed in Table 4 can be 
used to drive the Pentium processor loads directly if 
only one copy of 66 MHz clock signal is needed. In this 
case, the second level buffers are not necessary if the 
driver used can provide enough 33 MHz copies. Intel 
has not done formal analysis on these parts. 


Table 5. List of Clock Divider Parts 


Driver| # of Outputs 
Manufacturer Part # 
| Manutacurr | part |ret (per pkg) 


88915 | PLL 1 @ 2x 
88916 | PLL 


6 @ 1x (1) 
Texas Instruments | ABT338 | PLL 


1 @0.5x 
1 @ 2x 

4 @ 1x (1) 
1 @ 0.5x 


Texas Instruments | ABT337 | Buffer 


Texas Instruments | ABT339 | Buffer 


TriQuint GA1086 | PLL 9 @ 1x 
1 @ 0.5x 


NOTES: 

1. One of the outputs is inverted. 

2. This list is not meant to be complete. Other solutions 
may be available. 
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Table 5 lists examples of clock drivers that offer divided 
by 2 outputs. These devices can be used as the first level 
drivers illustrated in Figure 8 through Figure 12. De- 
pending on the number of 66 MHz copies and 33 MHz 
copies needed, the second level buffers may not be nec- 


essary. Again, Intel has not performed any formal anal- 
ysis on these parts. 


4.3 Clock Generation for 
Asynchronous Systems 


If the memory bus is not synchronized with the CPU or 
CPU-cache module, clock generation for the system is 
easier compared with the two configurations above. 
However, clock synchronization for the Pentium proc- 
essor, 82496, and 82491, as well as the clocks for the 
MBC is still a concern. In order for the MBC to com- 
municate properly with the CPU-Cache chip set, some 
synchronized clocks at 66 MHz are needed. Since the 
system is asynchronous to the CPU-Cache chip set, the 
number of synchronous MBC clock signals is less than 
the synchronous case. The examples in Section 5.0 il- 
lustrate how the synchronization is done. Since the sys- 
tem is asynchronous, one can use a different clock 
source for the CPU-Cache chip set from the rest of the 
system. 


5.0 Pentium™ PROCESSOR WITH 
256K 82496/82491 SECOND 
LEVEL CACHE CLOCK 
DISTRIBUTION DESIGN 
EXAMPLES 


After a clock generation scheme is determined, careful 
analysis must be done on clock distribution to ensure 
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minimal skew and proper rise and fall times. Clock dis- 
tribution is the connection between clock driver outputs 
and clock inputs of the components that need clocking. 
Preliminary analysis has been done on several of the 
drivers listed in Section 3.0. The following examples 
show in detail how to terminate transmission lines 
properly, tune clock traces to minimize board trace 
skew, and validate the usefulness of the drivers to the 
CPU-Cache chip set using models from the manufac- 
turers. The examples have been done using preliminary 
or typical models for the devices involved. They are 
meant as an example of the process designers can use 
when selecting and routing a clock circuit. Although 
the examples only show the clock distribution for the 
CPU-Cache chip set, the same principles can be applied 
to distribution to the memory bus controller (MBC) 
and other parts. 


5.1 Clock Routing for the 256K CPU- 
Cache Chip Set 


Analysis for CPU-Cache chip set clock routing is done 
using first order input models for the three chips, 
shown in Figure 13. Specific to CLK inputs, the models 
shown are typical models based on on-going simulation 
efforts, and are subject to change. Refer to the Intel 
data books or contact Intel for the latest models (in- 
cluding minimum and maximum conditions). The mod- 
els include package inductance, package capacitance, 
and input buffer capacitance of the clock pins. 
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Figure 13. Pentium™ Processor, 82496 and 82491 Clock Input Models 


3-150 PRELIMINARY : 


8249 1#9 


O 


82491#10 


O 


°o Via 


© 82491 CLK input 


; % wees PD SE! eS + F = C Fee 2 * Sia. Aw, ~ 7 rT eet. 4 a SF CF eee Se 
meee — y: See 3; : i oe TS ect i ee . os ety AS 


Oty 
Pentium 
Processor 


O Pentium Processor, 82496 CLK input 


R; Series Termination Resistor 


Trace lengths (in mils): 
O0—83 (top) 

1—201.5 (internal 2) 
2—1608.4 (internal 2) 
3—6578.7 (internal 3) 
4—7250.9 (internal 1) 
5—805.7 (internal 4) 
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Figure 14. CLKO Layout for 256K Chip Set with Parity 
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Ry Series Termination Resistor 
Trace lengths (in mils): 
O0—78 (top) 
1—455.3 (internal 2) 
2—3639.3 (internal 1) 
3—2180.7 (internal 4) 
4—530.7 (internal 4) 
5—75 (top) 
6—727.2 (internal 4) 
7—75 (top) 

8—462.4 (internal 2) 
9—4650 (internal 1) 
10—1156.7 (internal 4) 
11—526.5 (internal 4) 

12—75 (top) 
13—520.7 (internal 4) 
14—75 (top) 


Figure 15. CLK1 Layout for 256K Chip Set with Parity 
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R; Series Termination Resistor 
Trace lengths (in mils): 
0—78 (top) 
1—206.5 (internal 2) 
2—3480.5 (internal 1) 
3—2550.9 (internal 4) 
4—560.2 (internal 4) 
5—75 (top) 
6—527.2 (internal 4) 
7—75 (top) 
8—160 (internal 2) 
9—1236.0 (internal 2) 
10—4290.9 (internal 3) 
11—631.0 (internal 1) 
12—530.7 (internal 4) 
13—75 (top) 
14—534.3 (internal 4) 
15—75 (top) 


Figure 16. CLK2 Layout for 256K Chip Set with Parity 
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R; Series Termination Resistor 

Trace lengths (in mils): 

0—75 (top) 

1—562.4 (internal 2) 

2—5897.2 (internal 3); 7147.2 for National 74CT2527 as clock driver 
3—631.0 (internal 4) 

4—526.5 (internal 4) 

5—75 (top) 

6—520.7 (internal 4) 

7—75 (top) 


Figure 17. CLK3 Layout for 256K Chip Set with Parity 
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Figure 14 through Figure 17 show how clock signals 
are distributed from the clock driver to each compo- 
nent in the CPU-Cache chip set. Each clock line is 
tuned to minimize skew. Series termination is used on 
each line. Since the 82491, numbers 9 and 10, are parity 
chips, they are grouped onto the same driver output. 
Since the skew requirement between the Pentium proc- 
essor and the 82496 is very tight (0.2 ns), they are also 
on the same driver output. All the loads on the same 
driver output can be tuned to have close to zero skew. 
Loads on different outputs, however, must contend 
with the output skew of the driver. CLKO through 
CLK2 are layed out such that they branch off very 
close to the driver. For these lines, the transmission line 
characteristic impedance that the clock driver output 
sees can be treated as two resistors in parallel. In other 
words, the driver output sees half the impedance for 
CLKO, CLK1, and CLK2. The advantage of this 
scheme is that the value of the termination resistor is 
reduced dramatically. A smaller termination resistor 
helps faster rise and fall times. For CLK3, the branch 
off is at the end of the line, and thus the driver output 
sees the full characteristic impedance. 


To achieve minimal skew, all the loads should be bal- 
anced, i.e., all the loads should be the same. For this 
design example, CLK 1 and CLK2 have about twice the 
loads of CLKO and CLK3. The load imbalance will add 
to the output skew quoted by manufacturers since on 
data sheets, manufacturers generally quote output skew 
for balanced loads. Since heavier loads see the transmit- 
ted signal later than lighter loads assuming the trans- 
mission lines are of the same length, traces for the light- 
er loads can be made longer to compensate for the dis- 
crepancy. CLKO and CLK3 have longer traces from 
driver output to load than CLK1 and CLK2 traces. 
Since heavier loads (higher capacitance) have a longer 
rise time, and since for the CPU-Cache chip set skew 
measurements are taken at 0.8V, 1.5V, and 2.0V, to 
minimize skew at all free points, the termination resis- 
tors for CLK1 and CLK2 should be smaller than the 
termination resistors on CLKO. However, a smaller ter- 
mination resistor than the value needed to perfectly ter- 
minate the line will result in a larger undershoot. When 
choosing termination values, it is a trade off among 
rise/fall times, skew, and undershoot. 


When choosing a termination value, it is important to 
know the output impedance of the driver. For many 
TTL drivers, output rise impedance is different from 
output fall impedance. [Refrence 3, Section 9] shows 
how to measure output impedance, or the driver manu- 
facturer can be contacted for the information. Typical- 
ly, output fall impedance is 50-10, and rise imped- 
ance is 51-502. 
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Figure 14 through Figure 17 are extensions to the lay- 
out topologies in the Pentium™ Processor, 82496, and 
8249] 256K CPU-Cache Chip Set Layout Example. The 
routing topologies 15—18 shown in the example route 
the clock signals from the Pentium processor, 82496, 
and all the 82491’s to the outside. The layout examples 
shown in this application note (Figure 14 through Fig- 
ure 17) take the layout all the way to the clock driver, 
complete with termination. Although in the example, 
clock signals are routed toward the bottom and in the 
examples here, the clock signals are routed toward the 
side, the same principles apply. If routing toward the 
bottom is preferred, the same layouts as illustrated in 
Figure 14 through Figure 17 can be used with little or 
no modification. 


5.2 Analysis of Drivers Used in 
Examples 


Output _models for MC10H646, CGS74CT2527, 
GA1086, and VSL4485 are used to drive the clock net- 
work described in Section 5.1. The clock networks 
shown in Figure 14 through Figure 17 were used. The 
simulations assume no variation in characteristic im- 
pedance and propagation speed for the board traces. 
Fast and slow simulations were performed. Three sig- 
ma clock driver models are used when available. Board 
traces are assumed to have plus/minus 10% variation 
in characteristic impedance and propagational speed. 
Table 6 shows the range of trace characteristics. .Slow 
simulations assume the highest operating temperature 
the drivers expect to see, and slow interconnect charac- 
teristics. Fast simulations assume operating tempera- 
ture to be zero and fast interconnect characteristics. 


Table 6. Interconnect Characteristics 


Trace Z0(1) TD(2) 
a (Q) (ns/ft) 


ev 
[Surece | 72 


NOTES: 
1. Characteristic Impedance 
2. Propagational Speed 


Since simulation can only account for skew due to 
board trace and load imbalance, total skew is assumed 
to be the sum of the worst case output skew published 
by driver manufacturers and skew from simulation. 
Skew from simulation is derived by using identical driv- 
er models for each driver output, thus assuming zero 


3-155 


= mr 


~ AP-479 


output skew. The board traces and termination resist- 
ances are tuned with 0.5 ns output skew in mind which 
leaves 0.2 ns for trace skew. For TriQuint’s GA1086, 
the output skew is 0.25 ns; thus, there is a larger win- 
dow for trace skew. Table 7 summarizes simulation re- 
sults of the tightest parameters for each driver. All of 
the drivers can meet the 4 ns minimum high and low 
times easily. Most clock drivers guarantee a 45/55 duty 
cycle, which exceeds Intel’s requirement. 


Series termination resistors are chosen to minimize 
skew and undershoot. To achieve similar rise time for 
each load, termination resistance values are smaller for 
heavier loaded lines such as CLK1 and CLK2 com- 
pared to the resistance values for CLKO. Since CLK3 
splits off at the end of the line, its termination value is 
about twice as CLKO’s. Table 8 lists the termination 
values for each line and for each driver. Waveforms for 
each driver are attached. Notice the signals at the CLK 


te 
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input for each load is relatively clean whereas the sig- 
nals at the driver side are not. Since the clock signals 
are only important to the component receiving the sig- 
nal, how dirty the signal is at the driver end is not 


important, providing that the signal does not cause any 
damage or other ill effects on the driver. 


Figures attached in this section show some waveforms 
from the simulations. V(201) is the voltage at the Penti- 
um processor clock input, V(202) is the voltage at the 
82496 clock input, and. V(213), V(214), V(217), and 
V(218) are voltages at the 82491 clock inputs for the 
82491s on CLK1 line. For 74CT2527, V(8) is the volt- 
age at driver output, V(100) is the voltage at the junc- 


tion of the series termination resistor and the beginning 


of board trace. For 10H646, V(9) is the voltage at driv- 
er output, V(20) is the voltage at the junction of the 
series termination resistor and the beginning of board 
trace. 


Table 7. Compilation of Simulation Data 


Worst Skew 
P5-C5C 
pra mesa 


Clock 
Driver 


NOTES: 

1. All Skews are worst case numbers 

2. Not using the parity chips 

3. Worst Undershoot of all the CLK nodes 

4. Slowest rise and fall times of all the CLK nodes 


Worst Skew 
C8C-Others 


we ee rR oe eo So a 


Motorola} 10H646 St 0.67 |0 0.67 468 |816 | 1600 |0.90/ aa 
1.13 
tee 42 


— — 0.45|0.7 |0.45 07 a 1600 |0.9/ sa 1 is 
18 othe: £15 
(6) 
a 


Worst Skew 
C8C (No Parity) 
(ns) (2) 


Undershoot 
(—mV)(3) 


al, 
on 
~ 


wal ie rT Be a a 


5. Only typical model at 25°C is available. Thus, only simulation performed is with slow interconnect corner 
6. Simulation done on driver slow corner. Device specification for ts is 1.4 ns worst case. Device was still under development 
when simulation was done. Please contact TriQuint for more information. 
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Clock distribution method for the memory bus control- 
ler (MBC) is very similar to that of the chip set. When 
distributing clocks for the MBC, be sure to load each 
driver output with similar loads as for the chip set, and 
route clock traces with similar lengths as for the chip 
set. For example, CLK1 and CLK2 have an aggregate 
load of about 20 pF, and the total clock trace length is 
about 7” from driver output to a load. To minimize the 
clock skew of the MBC clock from loads on CLK1 and 
CLK2 lines, the clock lines should fan out 2.9 pF per 
inch. Also, be sure to terminate the line properly. It is 
important to keep the loading similar to the loadings on 
clock lines of the chip set if skew is to be kept close to 
0.7 ns. Adjusting trace lengths and termination resist- 
ance can compensate for load imbalance to a degree, 
but not perfectly and not always. 


Simulations results provided here are based on best 
available models at the time. Some models were for 
parts still under development at the time of simulation. 
Therefore, the simulation results are subject to change. 
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Table 8. Series Termination 
Resistor Values for Each Line 


10H646 
eee 
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National 2527 CLK simulation, CLKO slow 
Date/Time run: 03/90/92 20:28:29 
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Figure 20. Vitesse (Slow) Waveform 
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Figure 21. Vitesse (Slow) Waveform (Continued) 
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Figure 22. Vitesse (Fast) Waveform 
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Figure 23. Triquint Waveform 
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Figure 24. Triquint Waveform (Continued) 
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6.0 Pentium™ PROCESSOR WITH 
512K 82496/82491 SECOND 
LEVEL CACHE CLOCK 
DISTRIBUTION ISSUES 


Clock distribution for 512K CPU-Cache chip set can be 
done in the same way as for the 256K chip set. Since 
there are more SRAM chips for the 512K cache, there 
are more loads that need clocking. Including parity, 
there are 18 82491s. Once again, the same principles 
apply. Keep the driver loading as close to balanced as 
possible. Tune traces and adjust termination resistance 
so that skew is minimized. 


7.0 CLOCK DISTRIBUTION FOR THE 
Pentium™ PROCESSOR WITH 
OTHER SECOND LEVEL 
CACHES 


The Pentium processor can be used with cache configu- 
rations other than with the 82496 and 82491, as well as, 
without a second level cache. With other caches, the 
first thing that must be done is to decide how much 
skew is tolerable. Then, decide on which clock driver to 
use and carefully layout clock signals for distribution. 
If skew requirements do not exceed the CPU-Cache 
chip set requirements, the same drivers and the same 
distribution can be used. Design examples in Section 
5.0 serve as a guide to how to distribute clocks for Pen- 
tium processor systems with tight skew. 


If the Pentium processor is used without a second level 
cache, and only a small number of 66 MHz signals are 
needed, there are a few more options for clock drivers. 
For example, Motorola’s 88915 has one 2x output that 
can run to maximum 70 MHz. Texas Instruments has 
the ABT337, 338, and 339 that can provide four copies 
of 66 MHz signals. 


8.0 SUMMARY 


At high speeds, clock synchronization becomes a diffi- 
cult problem. Clock traces must be treated as transmis- 
sion lines. Proper termination must be given to the lines 
to ensure good signal quality. The Pentium processor, 
with operating frequencies of 60 MHz and 66 MHz, has 
tight clock requirements. Together with the 82496 
Cache Controller and 82491 Cache SRAM, the CPU- 
Cache chip set must be synchronized with minimal 
skew. 


For the Pentium processor clocking, the most critical 
parameters are skew and rise and fall times. Depending 
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on the memory interface to the CPU-Cache chip set, 
there are many ways of generating multiple copies of 
clock signals. 


Fully synchronous designs need to route 66 MHz only, 
but with minimal skew for all of them. Divided syn- 
chronous designs require both 66 MHz and 33 MHz 
signals. Asynchronous designs need to worry about the 
CPU-Cache chip set clock generation and distribution 
as well as the MBC. 


Several clock drivers have been analyzed in detail with 
carefully tuned clock routing and the proper termina- 
tion such that the clock signals transmitted to the Pen- 
tium processor, 82496, and 82491 meet all the timing 
requirements of the Intel chip set parts. Loading on a 
clock driver should be as balanced as possible. Clock 
traces should have equivalent length from driver output 
to load. The clock lines should be terminated properly 
to minimize reflections. 


The same design principles used in the 256K CPU- 
Cache chip set clocking example can be applied to oth- 
er CPU-cache configurations, or to a cacheless inter- 
face. 


This application note has listed a number of devices 
from several different manufacturers. The purpose of . 
this list is to supply a starting point for finding a clock- 
ing solution that meets each system’s specific require- 
ments. The lists provided are not meant as an endorse- 
ment or guarantee of the devices listed. In addition, 
these lists are not a complete listing of devices. These or 
other manufacturers may offer additional devices that 
meet the clock specifications for the Pentium processor. 
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APPENDIX A 
CLOCK DRIVER MANUFACTURERS 


The following is a list of contacts for the clock driver 
manufacturers listed in this application note. It is not 
meant to be an exhaustive list of all possible solutions. 
It is meant as a starting point for system designers to 
assist in finding a clock solution that meets their system 
requirements. 


AMCC 


United States: 

Headquarters 

6195 Lusk Boulevard 

San Diego, CA 92121-2793 

Ph: 619-450-9333 or 800-PLL-AMCC (755-2622) 
FAX: 619-450-9885 


Europe: 


Amega Electronics 
Basingstoke, RG24OPF, U.K. 
Ph: 011/44-256-843 166 


Japan: 


Teksel Co., Ltd. 
Kawasaki 213 

Tokyo, Japan 

Ph: 011/81-448 127430 


Israel: 


EIM 
Petach Tiqva, Israel 
Ph: 011/972-3-9233257 


AT&T Microelectronics 


AT&T Customer Response Center 
Ph: 800-372-2447 x773 


Danny George 

555 Union Blvd. 
Allentown, PA 18103 — 
50N2G2100 

Ph: 215-439-6697 


Cypress 


Sean Dingman 

3901 N. Ist St. 

San Jose, CA 95134 
408-943-2743 
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ICS 


Bruce Rogers 

Technical Marketing Manager 
2626 Van Buren Ave. 

P.O. Box 968 

Valley Forge, PA 19482 
215-666-1900 


Intel PLD BU International Contact List 
United States: 


John Van Sack 

Intel Corporation 
FM4-42 

1900 Prairie City Road 
Folsom, CA 95630 

Ph: (916) 356-3964 
FAX: (916) 356-6949 


Europe: 


Tony O’Sullivan 

Intel Corporation GmbH 
Dornacher Str. 1 

PostFach 213 

D-8016 FeldKirchen/Munchen 
Germany 

Ph: (49) 89/90992-340 

FAX: (49)89/9043948 


Japan: 


Norikazu Aoki 

5-6 Tokodia, Tsukuba-shi 
Ibaraki-Ken 300-26 
Japan 

Ph: 0298-47-0721 

FAX: 0298-47-8819 


APAC: 


Eric Chan 

Intel Technology SDN BHD 
Bayan Lepas Free Trade Zone, 
Box 121 

11900 Penang 

Malaysia 

Ph: 604-820-7271 

FAX: 604-836-405 
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Motorola Inc. Texas Instruments 
Todd Pearson United States: 
Motorola Inc 
2200 W. Broadway Rd. Steve Plote 
Mesa, Arizona 85202 Program Manager 
USA CLOCK DRIVERS 
Ph: (602) 962-3410 8330 LBJ Freeway, Center 3 
P.O. Box 655303 
Masanori Matsubara Dallas, Texas 75265 
Nippon Motorola LTD Ph: 214-997-5214 
3-20-1, Minami-Azabu 
-Minato Ku, Tokyo 106 Brett Clark 
Japan Applications Engineer 
~ Ph: 81-33-280-8383 Ph: 903-868-5836 
Axel Krepil Japan: 
Motorola GMBH 
Schatzbogen 7 Mich Komatsu 
8000 Munchen 81 Texas Instruments Japan LTD. 
Germany M.S. Shibaura Bldg. 13-23 
Ph: 49-89-92 103-167 Shibaura 4-Chome 
Minato-ku Tokyo, 108 Japan 
Derek Leung Ph: 033-769-8717 
Motorola Hong Kong LTD 
Silicon Harbour Center Asia Pacific Region: 
2 Dai King Street 
Taipo Industrial Estate Eric Wey 
Taipo N. T. Hong Kong Texas Instruments Taiwan LTD. 
Ph: 852-666-8194 Taipei Branch 
10F Bank Tower, 205 Tung Hua N. 
National Semiconductor Taipei, Tailwan ROC 


Ph: 886-2-713-9311 
National Semiconductor 


Santa Clara, CA Europe: 

Tony Ochoa 

Ph: 408-721-6804 Lothar Katz 

Ph: 800-272-9959 Texas Instruments 

) 8050 Freising, Fed. Rep. of Ger. 

Pioneer Semiconductor Deutschland GMBH 
Haggertystr. 1 

Joe Kraus Ph: 49-816-80314 

2343 Bering Dr. 

San Jose, CA 95131 TriQuint Semiconductor 

Ph: 408-435-0800 

FAX: 408-435-1100 United States: 


Marketing, Sunil Sanghavi 

(408) 982-0900 x142, FAX (408) 982-0222 
Western Sales, Mark Wu 

(408) 982-0900 x113, FAX (408) 982-0222 
Central Sales, John Watson 

(214) 422-2532, FAX (214) 423-4947 

East Sales, Mike Zyla 

(215) 493-6944, FAX (215) 493-7418 
International, Mike Kilgore 

(503) 644-3535 x228, FAX (503) 644-3198 
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Europe - GiGA A/S 


Fin Helmer, President 
45-4-343-1588, FAX 45-4-343-5967 


Japan - Japan Macnica Corp. 


Shin Ishikawa, Product Manager 
045-939-9140, FAX 045-939-6141 


Vitesse Semiconductor 


United States: 


Corporate Headquarters 

Vitesse Semiconductor Corporation 
741 Calle Plano 

Camarillo, CA 93012 

Ph: 805-388-7501 

FAX: 805-388-7565 


Europe: 


Thomson Composants Micronodes 
Route Departementale 128 B.P. 48 
91401 Orsay Cedex France 

Ph: 33-1-60-19-7000 

FAX: 33-1-60-19-7140 
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Japan: 


H.Y. Associates Co., LTD. 


’ 3-1-10, Sekimachikita, Nerima-Ku 


Tokyo, 177 Japan 
Ph: 81-33-929-7111 
FAX: 81-33-928-0301 


Korea: 


Beaver International, Inc. 
3601 Deauville Court 
Calabasas, CA 91302 

Ph: 818-591-0356 

FAX: 818-591-0753 


Taiwan: 


Tamarack Microelectronics 

16 Fl., No. 1, Fu-Hsing N. Road 
Taipei, Taiwan, ROC } 

Ph: 886-2-772-7400 

FAX: 886-2-776-0545 
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1.0 INTRODUCTION 


In a system environment, the Pentium™ processor’s 
temperature is a function of both the system and com- 
ponent thermal characteristics. The system level ther- 
mal constraints imposed on the package are local ambi- 
ent temperature and thermal conductivity (i.e., airflow 
over the device). The Pentium processor thermal char- 
acteristics depend on the package (size and material), 
the type of interconnection to the printed circuit board 
(PCB), the presence of a heat sink, and the thermal 
conductivity and the power density of the PCB. 


All of these parameters are aggravated by the continued 
push of technology to increase the operating speeds and 
the packaging density. As operating frequencies in- 
crease and packaging size decreases the power density 
increases and the heat sink size and airflow become 
more constrained. The result is an increased impor- 
tance on system design to ensure that thermal design 
requirements are met for each component in the sys- 
tem. 


In addition to heat sinks and fans, there are other solu- 
tions for cooling integrated circuit devices. A few of 
these solutions are: fan mounted on heat sink, heat 
pipes, thermoelectric (peltier) cooling, liquid cooling, 
etc. While these alternatives are capable of dissipating 
additional heat, they have disadvantages in terms of 
system cost, complexity, reliability, and efficiency. 
These techniques are more expensive than a passive 
heat sink and fan. The introduction of active devices 
can also decrease reliability. Finally, the power efficien- 
cy of some of these techniques is poor, and gets worse 
as the amount of power being dissipated increases. De- 
spite these disadvantages, each of these solutions may 
be the right one for particular system implementations. 


However, for the purpose of this application note, Intel 


has focused its efforts on describing solutions using pas- 
sive heat sinks and fans. 
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1.1 Document Goal 


The goal of this document is to provide thermal per- 
formance information for the Pentium processor and 
recommendations for meeting the thermal requirements 
imposed on systems. This application note attempts to 
provide an understanding of the thermal characteristics 
of the Pentium processor and some examples of how 
the thermal requirements can be met. 


2.0 IMPORTANCE OF THERMAL 
MANAGEMENT 


Thermal management of an electronic system encom- 
passes al! of the thermal processes and technologies that 
must be employed to remove and transfer heat from 
individual components to the system’s thermal sink in a 
controlled manner. 


The objective of thermal management is to ensure that 
the temperature of all components is maintained within 
functional and absolute maximum limits. The function- 
al temperature limit is the range within which the elec- 
trical circuits can be expected to meet their specified 
performance requirements. Operation outside the func- 
tional limit can degrade system performance or cause 
logic errors. The absolute maximum temperature limit 
is the highest temperature that a portion of the compo- 
nent may be safely exposed. Temperatures exceeding 
the limit can cause physical destruction or may result in 
irreversible changes in operating characteristics. Higher 
temperatures result in earlier failure of the devices in 
the system. Every 10°C rise above the operating range 
means a halving of the mean time between failures. 
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3.0 Pentium™ PROCESSOR POWER 
SPECIFICATIONS 


The Pentium processor’s power dissipation and case 
temperature specs for 60 MHz and 66 MHz are shown 
in Table 1. 


To ensure functionality and reliability of the Pentium 
processor, maximum device junction temperature must 
remain below 90°C. Considering the power dissipation 
levels and typical ambient environments of 40°C to 
45°C, the Pentium processor’s junction temperatures 
cannot be maintained below 90°C without additional 
thermal enhancement to dissipate the heat generated by 
this level of power consumption. 


The thermal characterization data described in Table 2 
illustrates that both a heat sink and airflow are needed. 
The size of heat sink and the amount of airflow are 
interrelated and can be traded off against each other. 
For example, an increase in heat sink size decreases the 
amount of airflow required. In a typical system, heat 
sink size is limited by board layout, spacing, and com- 
ponent placement. Airflow is limited by the size and 
number of fans along with their placement in relation 
to the components and the airflow channels. In addi- 
tion, acoustic noise constraints may limit the size or 
types of fans limiting the airflow. 


To develop a reliable thermal solution, all of the above 
variables must be considered. Thermal characterization 
and simulation should be carried out at the entire sys- 
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tem level accounting for the thermal requirements of 
each component. 


4.0 THERMAL PARAMETERS 


Component power dissipation results in a rise in tem- 
perature relative to the temperature of a reference 
point. The amount of rise in temperature depends on 
the net thermal resistance between the junction and the 
reference point. Thermal resistance is the key factor in 
determining the power handling capability of any elec- 
tronic package. 


Thermal resistance from junction to case (@jc), and 
from junction to ambient (@y,) are the two most often 
specified thermal parameters for integrated circuit 
packages. 


4.1 Ambient Temperature 


Ambient temperature is the temperature of the undis- 
tributed ambient air surrounding the package. Denoted 
Ta, ambient temperature is usually measured at a spec- 
ified distance away from the package. In the laboratory 
test environment, ambient temperature is measured 12 
inches upstream from the package under investigation. 
In a system environment, ambient temperature is the 
temperature of the air upstream to the package and in 
its close vicinity. 


Table 1. Pentium™ Processor Power Dissipation 
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Package | Total Pin Package Power | Power | Max Case 

Type Pins | Array Size (Typical) | (Max) | Temp (°C) 
Pentium Processor 60 MHz 273 | 24x21 | 2.16", %2.16" 11.9W 14.6W ea aor S 
Pentium Processor 66 MHZ PGA 21x 21 2.16" x 2.16" | 


130 
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4.2 Case Temperature 


Case temperature, denoted Tc, is measured at the cen- 
ter of the top surface (on top of the heat spreader, see 
Figure 1) of the package, typically the hottest point on 
the package case. Special care is required when measur- 
ing the case temperature to ensure an accurate tempera- 
ture measurement. Thermocouples are often used to 
measure Tc. Before any temperature measurements, 
the thermocouples have to be calibrated. When measur- 
ing the temperature of a surface which is at a different 
temperature from the surrounding ambient air, errors 
could be introduced in the measurements. The mea- 
surement errors could be due to having a poor thermal 
contact between the thermocouple junction and the sur- 
face, heat loss by radiation or by conduction through 
thermocouple leads. To minimize the measurement er- 
rors, it is recommended to use the following approach: 


© Use 36 gauge or finer diameter K, T, or J type ther- 
mocouples. The laboratory testing was done using a 
thermocouple made by Omega (part number: 
STC-TTK-36-36). 


e Attach the thermocouple bead or junction to the 
center of the package top surface using high thermal 
conductivity cements. The laboratory testing was 
done by using Omega Bond (part number: OB-100). 


©@ The thermocouple should be attached at a 90° angle 
as shown in Figure 1. When a heat sink is attached a 
hole (no larger than 0.15”) should be drilled 
through the heat sink to allow probing the center of 
the package as shown in Figure 1. 
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e If the case temperature is measured with a heat sink 
attached to the package, drill a hole through the 
heat sink to route the thermocouple wire out. 


4.3 Junction Temperature 


Junction temperature, denoted Ty, is the average tem- 
perature of the die within the package. 


The junction temperature for a given junction-to-ambi- 
ent thermal resistance, power dissipation, and ambient 
temperature is given by the following formula: 


Ty = Pp * Oya + Ta 

If a heat sink with thermal resistance of Osa (sink-to- 
ambient) is used, then the thermal resistance from the 
junction-to-case, yc, is given by the following formula: 
Ty. = Pp * (80sec + Pes + Oga) + Ta 


where: 


O@cs is the thermal resistance from the component 
(case) to the heat sink. 
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Figure 1. Thermocouple Attachment 
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4.4 Thermal Resistance 


Thermal resistance (Figure 2) values for junction-to- 
ambient, 6yq4 and junction-to-case, 9jc, are used as 
measures of IC package thermal performance. @jc¢ is a 
measure of the package’s internal thermal resistance 
along the major heat flow path from silicon die to pack- 
age exterior. This value is strongly dependent on the 
material, thermal conductivity, and geometry of the 
package. 0yc values also depend on the location of the 
reference point (in this case center of the package top 
surface), the external cooling configurations. and the 
heat flow paths from the package to the ambient. For 
example, if a heat sink is attached to the package top 
surface or more heat is pulled into the board through 
the pins, the 6jc values measured with reference to the 
center of the package top surface will change. Oy, val- 
ues include not only internal thermal resistance, but 
also the radiative and convective thermal resistance 
from the package exterior to ambient air. 0y,4 values 
depend on the material, thermal conductivity, and ge- 
ometry of the package and also on ambient conditions 
such as airflow rates and coolant physical properties. 


In order to obtain thermal resistance values, junction 
temperature is measured using the temperature sensi- 
tive parameter (TSP) method. With this method, spe- 
cial design thermal test structures are used which are 
approximately the same size as the Pentium processor 
die. The test structure consists of resistors and diodes. 


Heat Sprea 


i: 

intel : 
Resistors are used to simulate the Pentium processor 
power dissipation and thereby heat up the package. Di- 
odes, which are located at the center of the thermal test 
die, are used to measure the die temperature. The mea- 
surements are carried out in a wind tunnel environ- 
ment. The air flow rate and the ambient temperature 


are measured 12 inches away from the package in the 
upstream air. 


The parameters are defined by the following relation- 
ships: 

4 = (Ty — Ta/Pp 

Ore, = (Ty — TeyPp 

934 = 93c + Oca 


where: 
634 = junction-to-ambient thermal resistance 
°C/W) 


0;c = junction-to-case thermal resistance °C/W) 


9cA = case-to-ambient thermal resistance (°C/W) 
Ty = average die (junction) temperature (°C) 

Tc = case temperature at a pre-defined location (°C) 
Ta = ambient temperature (°C) 


Pp = device power dissipation (W) 


Ambient (Air) 


Case 


Junction (Die) 
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Figure 2. Thermal Resistance Parameters 
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Table 2 lists the junction-to-case and case-to-ambient thermal resistances for the Pentium processor (with and 


without a heat sink). 


Table 2. Thermal Characterization Data 
Oca **vs Airflow (ft/min.) 


NOTE: 


Heat Sink: 2.1 in2 base, omni-directional pin Al heat sink with 0.050 in. pin width, 0.143 in. pin-to-pin center spacing and 
0.150 in. base thickness. Heat sinks are attached to the package with a 2 to 4 mil thick layer of typical thermal grease. The 


thermal conductivity of this grease is about 1.2 w/m c. 


** @ca values shown in this table are typical values. The actual @cq values depend on the air flow in the system (which is 
typically unsteady, non-uniform and turbulent) and thermal interactions between Pentium CPU and surrounding components 


through PCB and the ambient. 


5.0 DESIGNING FOR THERMAL 
PERFORMANCE 


At this point the application note turns from describing 
the characteristics that define thermal performance to 
describing how designers should use these characteris- 
tics to assess thermal requirements of PC system de- 
signs. The Pentium processor specifies a maximum case 
temperature, Tc, of 70°C @ 66 MHz. This case temper- 
ature limit along with the Pentium processor’s power 
and thermal resistance characteristics can be used to 
determine the ambient temperature required to keep 
the Pentium processor operating within its specified 
limits. Using these parameters in the following equa- 
tions: 
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TA. = Jo.-— ecw 
Ta = 70°C —(16W* 10.5°C/W) 
Ta = —98°C 


The maximum ambient temperature required in a Pen- 
tium processor system without any additional thermal 
enhancement is — 98°C at 66 MHz. Obviously, this am- 
bient temperature is impractical and unachievable in a 
PC system. In order to be able to maintain the case 
temperature at 70°C in a typical system ambient with 
air temperature of 40°C to 45°C, the thermal resistance 
between the case and the ambient must be reduced. 


3-175 


AP-480 


5.1 Heat Sinks 


The most common way to improve the package thermal 
performance is to increase the surface area of the device 
by attaching a large piece of metal (a heat sink) to the 
package. The heat sink is usually made of Aluminum 
and is chosen for its price/thermal-performance ratio. 
There are materials that offer higher conductivity such 
as copper, but cost becomes prohibitive. To maximize 
the flow of heat for a given junction temperature rise 
over the ambient temperature, the thermal resistance 
from heat sink to air can be reduced by a) maximizing 
the surface area, and b) maximizing the air flow across 
the surface area (maximizing air flow through heat sink 
fins in most cases). 


Intel has used test data to determine what size of heat 
sink and airflow is needed to properly cool a Pentium 
processor system. The data was derived assuming an 
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adhesive attach process that offers thermal resistance of 
about 0.2°C/W. 


The testing was done in a wind tuunel in the configura- 
tion (in Figure 3) where the heat sink was mounted on 
a real Pentium processor package with a thermal die 
mounted inside to generate the 16W of power. The 
package is then mounted in a socket which is soldered 
to a 2-layer PCB that brings power to the die. 


Based on these tests, three specific heat sink and airflow 
combinations have been identified that properly dissi- 
pate the Pentium processor’s 16W and maintains a case 
temperature below 70°C @ 66 MHz. The three heat 
sinks are shown in Figure 4. 
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Figure 3. Improving Thermal Performance 


Heat sink A: H 1.4” for 200 LFMs of Air 
Heat sink B: H 1.05” for 300 LFMs of Air 
Heat sink C: H 0.85” for 400 LFMs of Air 


Assumption: Air Temp = 45°C 
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Figure 4. Recommended Combinations 
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In addition, testing has been done to provide more gen- 
eral guidelines which allow deviating from the above 
conditions. These guidelines allow systems to derive 
various combinations of heat sink size and airflow that 
ensure the Pentium processor thermal specifications are 
met. For example, by increasing the heat sink x-y di- 
mensions and extending it over the package footprint, 
the heat sink height can be reduced while maintaining 
the same thermal performance as the taller heat sink 
with the same footprint as that of the package. The first 
three charts (Figures 5, 6, 7) show the thermal resist- 
ance as a function of heat sink size and airflow. The last 
three charts (Figures 8, 9, 10) show the power dissipa- 
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tion achievable with a given heat sink size and airflow. 
The power dissipation calculations assume Tc = 70°C 
@ 66 MHz, Ta = 45°C, and @jc = 0.6°C/W. 


Pmax = (Tc — Ta)/O9cA = 25/0ca 


A key assumption in all of these calculations is that a 
perfect thermal connection can be achieved between the 
case and the heat sink. One ban extrapolate the heat 
sink solutions by adding the additional thermal resist- 
ance of any chosen heat sink attach process. See Ap- 
pendix B for case to heat sink thermal interface options. 


Heat Sink Size (Base) = 2.1 Inches Square 
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Figure 5. Thermal Resistance 


Heat Sink Size = 2.6 Inches Square 


Theta-CA (degrees C/W) 
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Figure 6. Thermal Resistance 
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Heat Sink Size (Base) = 3.0 Inches Square 


wm St + a & 


— 


= 
PA 
o 
3 
5 
fe, 
® 
= 


oO 


OA 06 O08 1 
Heat Sink Height (inches) 


241575-7 


Figure 7. Thermal Resistance 


Heat Sink Size (Base) = 2.1 Inches Square 
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Figure 8. Power Dissipation 
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Heat Sink Size (Base) = 2.6 Inches Square 


O50: 2 1200 
Heat sink Height (inches) 
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Figure 9. Power Dissipation 


Heat Sink Size (Base) = 3.0 Inches Square 
Airflow 
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Figure 10. Power Dissipation 
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5.2 Airflow 


To improve the effectiveness of heat sinks it is impor- 
tant to manage the airflow so as to maximize the 
amount of air that flows over the device or heat sink’s 
surface area. In the system, the air flow around the 
processor can be increased by providing an additional 
fan or increasing the output of existing fan. If this is not 
possible, baffling the airflow to direct it across the de- 
vice may help. This means the addition of sheet metal 
or objects to guide the air to the target device. Often the 
addition of simple baffles can eliminate the need for an 
extra fan. In addition, the order in which air passes 
over devices can impact the amount of heat dissipated. 


5.3 Fans 


Fans are often needed to assist in moving the air inside 
a chassis. A typical variable speed fan capable of up to 
100 CFM of air can be found for approximately $10. 


The airflow rate is usually directly related to the acous- 
tic noise level of the fan and system. Therefore maxi- 
mum acceptable noise levels may limit the fan output 
or the number of fans selected for a system. 


A fan may be placed at the top of a heat sink to pro- 
duce direct air impingement on the heat sink for 
efficient heat removal. A key issue with fans is their 
reliability. Although many fans are rated for approxi- 
mately 50,000 hours of operation, operating conditions 
such as operating temperature, pressure drop across the 
fan and the particles in the air can significantly reduce 
the fan’s useful life. 
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5.4 Thermal Performance Validation 


The recommended thermal solutions in Section 5.1-—5.3 
are only design guidelines. Since there are many vari- 
ables that can effect thermal performance in an actual 
system, thermal performance should always be validat- 
ed experimentally, following the case temperature mea- 
surement technique described in Section 4.2. 


6.0 CONCLUSION 


As the complexity of today’s microprocessors continues 
to increase so do the power dissipation requirements. 
Care must be taken to ensure the additional heat result- 
ing from the power is properly dissipated. As docu- 
mented, the heat can be dissipated using passive heat 
sinks, fans and/or active cooling devices. 


The simplest and probably most cost effective method 
is to use a heat sink and a fan. The size of the heat sink 
and the output of the fan can be varied to balance the 
tradeoffs between size and space constraints versus 
noise. For example, if space is available a 1.4” high 
heat sink can be used with only 200 LFM to cool the 
66 MHz Pentium processor and the 16W it dissipates. 
Another example in which space is restricted shows a 
0.7” high heat sink can be used if approximately 500 
LFM of airflow is provided. 


These are not the only valid solutions, but both provide 
adequate cooling to maintain the Pentium processor 
case temperature at or below the 70°C @ 66 MHz spec- 
ified limit. By maintaining this specification, the system 
can guarantee proper functionality and reliability of the 
Pentium processor. 
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APPENDIX A 
EXAMPLES 


Thermal management examples, designing with the Pentium processor. Using the best known methods. 


Appendix Goal 


The goal of this appendix is to measure the operating temperatures in a real system versus the wind tunnel laboratory 
measurements. These experiments are done with heat sinks that are similar to the ones suggested in Section 5.1 of the 
main document. The thermocouples and attachment methods suggested in Section 4.2 of the main document are also 
used. The appendix begins by reviewing the variables that the system designer has control over and uses tables to 
describe thermal resistance in the context of where the system designer can have the most effect The importance of 
the case to heat sink thermal interface and correct attachment methods are reviewed and different options given. The 
appendix proceeds to describe the system used for these tests and the tools and equipment needed. The lab set up 
procedures are discussed in detail and the measurements are presented with comments at the conclusion. 


WHAT ARE THE VARIABLES? 


Table A-1 shows the cooling options that customers can control when designing a system. From Table A-1 it is 
obvious that changing the heat sink and air flow are the two most effective ways for a system designer to affect the 
thermal performance of a system. 


Table A-1. Variables 


COOLING OPTIONS UNDER CUSTOMER CONTROL 
| Variables Options for Cooling 


Device e Use Power Management SW in the System 
@ Clock the Device at a Lower Speed 


Heat Sink © Increase Heat Sink Area, Width or Height 
e Use Interface Materials with Lower Thermal Resistance 
e Use Active Cooling Devices 
e Use a Combination of Active and Passive Cooling Devices 


Air Flow e Increase Air Flow 
® Manage Air Flow 
e Place Hottest IC’s near Highest Airflow 


Ambient Temperature e Place System in a Controlled Climate 
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® 
Figure A-1 sums up the thermal tradeoff picture succinctly. Looking at Figure A-1 reiterates the previous statement 
that increasing the heat sink size and air flow rate provide the largest thermal performance improvements. In 
addition it shows the variables that are constant. Note that the @jc (junction-to-case thermal resistance) of the 
Pentium processor is fixed and a system designer can have no effect on this parameter. Also note that the 0cg (case- 
to-heat sink thermal resistance) is a constant. Even though @c¢g is shown as a constant in Figure A-1 it can move up 


and down the Y axis depending on the interface material chosen. The case to heat sink interface is critical to the 
overall success of the thermal solution and cannot be overlooked. The next section will go into detail on this subject. 


Thermal Resistance Breakdown 


Notural/ 
Convection 


Low Air Flow 


Forced 
Convection 


Thermal 
Resistance 
(°C/W) 


200 


Air Flow (feet/minute) 
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Figure A-1. Thermal Resistance 
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The main of purpose Figure A-2 is to show that packages and heat sinks are not perfectly flat and this requires that 
the air gap be filled with an interface material that has a lower thermal resistance than air. The whole point is to try 
and minimize the contact thermal resistance. The different types of thermal interface materials are listed to show the 
wide array of materials available to the system designer. Intel’s data books have a mechanical section that list the 
flatness of the package. Heat sink vendors should be able to provide specifications for their heat sink offerings. 


Thermal Interfaces 


Thermal Interface case to heat sink 


Heat Sink 


@ interface materials are 
designed for transferring 
heat through the imperfect 
surfaces of the processor 
package and the heat sink 


m= TYPES OF THERMAL 


INTERFACE PRODUCTS ; 
Heat Sink 


= Elastomeric Pads 


= Thermal Compounds 


= Conductive Graphite 
Felt 


= Wax on Carrier Film 
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Figure A-2. Thermal Interfaces 


PRELIMINARY | 3-183 


= 


AP-480 : . | ntel . 


The next section (Figures A-3 through A-5) covers attachment methods which generally fall into the categories 
shown; epoxies, double sided tapes or manual clips to either chip or socket. 


Attachment Methods Double Side Tapes/Epoxies 


B Double sided tapes and epoxies 
are designed to either 
permanently or semi — permanently 
fasten the heat sink to the processor 
package and to promote heat 
transfer across the interface. 


m= TYPES OF ADHESIVES 


= Epoxies (1 & 2 Part) 


= Silicone Compounds 
(RTV) 

= Anerobic (1 Drop) 

= Acrylic based PSA 


Tapes 
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Figure A-3. Attachment Methods 


Attachment Methods Clipping Techniques 


@ Clips are designed to be a Clip to side of processor pkg. 
removable solution to the : RS ER ES BS Re 
processor heat sink attachment 
problem. The force generated 
minimizes c-s thermal 
resistances. 


TYPES OF CLIPS 


= Edge Plastic 
= Over-the-Top Metal 
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Figure A-4. Attachment Methods 
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Note that some clips don’t allow the package to be pushed all the way into the socket and this could be a problem 
with short lead packages. The main advantage of this type of system is that a low profile socket can be used to lower 
the height of the processor heat sink assembly. 


Attachment Method Clipping to Processor Socket 


@ Socket clips are designed to ‘ 


fasten heat sinks to processor’s which Be = 
have shortened pin lengths. se a 


= TYPE OF CLIP 


= Over-the-top Metal 
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Figure A-5. Attachment Methods 


Table A-2 lists pros and cons of the different attachment methods covered. 


Table A-2. Attachment Methods 
ATTACHMENT METHODS 


ADVANTAGES 


DOUBLE © Quick to Use 
SIDED e Low Installed Cost 
TAPES ¢ Compliant 


e Low Potential Thermal 
Resistance 
EPOXIES @ Low Contact Pressure 


¢ Centralized Pressure Points 

e Removable 

e Easily Installed 

e Solution to Upgrade 

e Accommodates Wide 
Tolerance 
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DISADVANTAGES 


e High Thermal Resistance 
e Requires Flat Interfaces 
e Assembly Contact Pressure 


@ Mixing, Curing, Messy 
e Timing Consuming (if not 
automated) 


e CTE Stress, High Rigidity 
e Variable Thickness (theta) 


e Removable 

© Force Limits vs Assembly 

@ Insufficient Shock and 
Vibration Data 

e Potential for Loss of Pressure 
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The materials chart Figure A-6 shows the performance of each type of thermal interface material. Note that even 
though thermal grease has a deserved reputation for being messy and harder to control it still performs well as a 
thermal interface. All the examples that are shown in Appendix A use thermal grease as the case to heat sink 
interface. 


Thermal Interface Materials 


“CW (sq. in.) 


Thermal Resistance (C/AW) 
Insulating 
Graphite : Elastomeric 
oe Liast ) Felt ono Mig” Elastomeric Net at atthe 
ss Performance) Pads (Adhesives 2 


Sides) = 


Dry Greases Thin Double Faced 
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Figure A-6. Materials 
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The next step is to chose a heat sink. Figure A-7 shows the wide range of choices and the cost associated with each 
technology. 


Now that all the variables and options are known for this problem we can proceed on to do some real system 
measurements using the recommendations and data shown in the first part of this application note. 


A Universe of Solutions to Thermal Management Problems 
$100.00 
2 
SSS 


\) 
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Required Thermal Resistance 
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Figure A-7. Solutions | 
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Examples 


For all the examples in this section we used a 40 MHz system with a Pentium processor and 256K cache. A picture 
of the system under test is shown in Figure A-8 with the covers off to show the placement of the Pentium processor 
and the associated cache components. A 40 MHz system was used because it was the only one available at the time 
the testing was done. 


intérface Processor Cache 
logic 
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Figure A-8. Pentium™ Processor System 
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Objectives 


To measure a Pentium processor system operating 
under real working conditions. 


To compare the measured results to the predicted 
results shown in the beginning of this application 
note. The reader should always keep the main goal 
in mind; the main goal is always to meet the case 


- temperature specification for the Pentium proces- 


sor. Any combination of heat sink and air flow rate 
is fine as long as the case temperature specification 
is met. The heat sinks used in test #1 thru #4 will 
match the suggested heat sinks as close as possible 
to accurately correlate with the wind tunnel data. 
This is meant to illustrate how a system designer 
might start by using the suggested heat sinks and air 
flow rates as starting points to thermally tune their 
particular system. Test #5 uses a heat sink and a 
fan combination. The fan heat sink is best described 
as a fan attached directly to the heat sink on the 
Pentium processor. It is an active device used for 
spot cooling ICs. We will concentrate on traditional 
passive heat sink solutions with only one set of mea- 
surements being done for a fan heat sink assembly. 


Tools and Equipment 


bs 


Pentium processor-based system running at 
40 MHz. 


. Hot wire anemometer to measure airflow rate. 
. Thermocouples and high thermal conductivity ce- 


ment as recommended in the application note. 


. Homemade jig for accurate and repeatable attach- 


ment of the thermocouples to the package. 


. Homemade power supply isolation socket for set- 


ting the Vcc and reading the Icc of the processor 
independently of the rest of the system. 


. Adjustable power supply with adequate current ca- 


pabilities and both current and voltage read out. 


7. Multimeter to read the voltage and current. 


10. 
il. 


. Cables to connect everything up. 
. Software test suite that simulates “worst case con- 


ditions for a typical real application.” In this case it 
was Microsoft Excel and Word for Windows test 
suites. 


Drill and drill bits. 
Thermal grease. 


The lab procedure was as follows: 


Preparing the System 


l. 


Load the test software on the system disk (or floppy) 
and make sure everything runs correctly before you 
start. After everything works satisfactorily proceed 
to the next step. 


. Remove the covers, choose several places (random) 


around the processor to measure the air flow of the 
system. Then drill holes large enough to allow the 
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anemometer to be inserted. Five holes were drilled 
in the system cover. 


3. In this case we had a 12” long */,” diameter direc- 
tional anemometer. To get more repeatable mea- 
surements the shaft of the probe was marked with a 

‘pencil to get the same depth, into the box, for each 
measurement 


4. We then removed the processor card from the chas- 
sis (use anti-static procedures to prevent IC dam- 
age). 

5. Remove the Pentium processor from the card and 
install the isolation socket. 


Preparing the Pentium Processor for Testing 

1. Using the jig carefully attach the thermocouple to 
the center of the processor package using cement 
and let it cure as recommended by the manufacturer 
of the cement. 


2. Drill holes no larger than 0.125” in the centers of 
the heat sinks to be tested just large enough to get 
the thermocouple wires through the hole. In the 
case of the fan heat sink, the fan was removed and 
the heat sink was drilled the same as the others and 
then re-assembled. Each of the holes were counter 
sunk on the bottom to better conform to the tear 
drop shape the thermocouple and cement naturally 
forms into. The idea is to not disturb or break the 
contact between the cement and the package. If it is 
broken or cracked the measurements will be incor- 
rect 

3. Apply the thermal grease (less than 0.004” thick) 
evenly, with no voids, to the processor package. 

4. Slide the heat sink down the thermocouple wires 
being careful not to disturb the thermocouple while 
at the same time firmly seating the heat sink to the 
package. Attach the plug for the temperature meter 
to the other end of the thermocouple wire terminals. 

5. Re-install the processor/thermocouple/heat sink as- 
sembly into the isolation socket on the processor 
board, again being careful not to disturb the thermo- 
couple connection. 


Preparing for Measurements 
1. Re-install the processor card into the system. 


2. Connect the power supply wires to the power supply 
and the isolation socket. 


3. Connect the multimeter to the the power supply to 
monitor the Vcc and set the power supply meter to 
measure Icc. 


4. Connect the thermocouple to the meter. 


5. Turn on the processor power supply and then the 
system supply. 


6. Wait for the system to boot and then run the test 
software. 
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Thermal Measurements 


The next step was to determine the baseline airflow in 
the system without a heat sink attached to the proces- 
sor. Measure the airflow at several locations using the 
access holes in the system and the marks on the probe 
to ensure accurate placement of the probe and repeata- 
bility of the measurements. Table A-3 shows the re- 
sults. Be cautious when placing the fan in a system 
relative to the processor. All fans have a dead spot (low 
airflow) in the center of the fan. Avoid the dead spot. 
Even several inches away from the fan the dead spot 
can influence airflow considerably. 
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The next step is to compare how close the suggested 
values and tables are to the measured results. Use the 
formulas described in the beginning of the application 
note and the values from Table A-4. 


Test #1 


Pp = Voc * Ioc = (1.82"4.89) = 8.827W 
Osc = (Ty—Tc)/Pp = 0.6 
Oya = (Ty-Ta)/Pp 
Oca = 9ja—95c = [(Ty-Ta)—(Ty—To)]/Pp 
= [Tco-Ta]/Pp 
Oca = (55.3 — 29)/(1.8*4.85) = 24/8.827 = 2.97 


6ca = 2.7°C/W is the measured value in the system 
for this configuration. 


Table A-3. Baseline Airflow 


Airflow Measured, LFM 


~ 2inches (upstream from the fan) @ center of the processor (above the heat 


sink) 


Heat Sink Size, 
inches 
Hx W 
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Air Flow 
LFM 


[405 [100-160 
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The graph (Figure A-9) from the application note, for 
heat sink size size of 2.1” x 2.1”, is used to compare the 


predicted 9c,, for a 1.2” tall heat sink, to the mea- 
sured value of Oca. 
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The predictions from the graph (Figure A-9) are: 


Oca = 2.9°C/W @ 100 LFM 
Oca = 2.4°C/W @ 150 LEM 
@ca = 1.9°C/W @ 200 LFM 


And the measured value is: 


8ca = 2.97°C/W @ 100-150 LFM 


Heat Sink Size (Base) = 2.1 Inches Square 
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Figure A-9. Thermal Resistance 


Table A-5. apa Conditions Test #2 


Heat Sink Size, 
Inches 
H x W 


05 x 2.1 sd. 
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Temperature, | Temperature,degreesC Cc 


lec Air Flow 
ie System Case Amps LEM 
Ta Tc 


[ze | 2 | sea | 101 | aes | 100-160 
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Test #2 


The same test only the heat sink height was reduced to 
0.5 inch height. 


8ca = (To — Ta)/Pp 
Oca = 58.3 - 29/8.79 = 3.33°C/W 


The 2. 1” x 2.1” graph (Figure A-9) from test #f is 


used again and it predicts: 


Oca = 4.9°C/W @ 100 LFM 
Oca = 4.3°C/W @ 150 LFM 
Oca = 3.8°C/W @ 200 LFM 


And the measured value is: 


Oca = 3.33°C/W @ 100-150 LFM 


Test #3 


This test is the same as test #2 except that processor 
board to board spacing was reduced to 0.6 inches using 


intel. 


a cardboard baffle to simulate a system with very tight 
board spacing. An existing system that is upgrading 
from an Intel 486 processor to the Pentium processor 
might have this type of spacing. Note that this particu- 
lar configuration actually has more airflow than test 
#2. It could have just as easily been lower. It all de- 
pends on the particular system being measured. 


8ca = (Tc — Ta)/Pp 
Oca = 70.3 - 27/8.79 = 4.9°C/W 


The 2.1 ” x 2.1” graph (Figure A-9) from test #1 is 
used again to predict the 0c,: 


Oca = 4.3°C/W @ 150 LFM 
Oca = 3.8°C/W @ 200 LFM 
Oca = 3.0°C/W @ 300 LFM 


And the measured value is: 


Oca = 4.9°C/W @ 175-200 LFM 


ge ea A-6. Test Conditions Test #3 


Heat Sink Size, 
Inches 
Hx W 


| Temperature,degreesC degrees C 
Icc Air Flow 
Ta 


| osx2tsa | 23 ar es | | a 


175- | 175-200 | 


mee eros A-7. Test Conditions Test #4 


Heat Sink Size, 
Inches 


Hx W 


0.65 x 3.1 sq. 
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| ‘Temperature,degreesC degrees C 
loc Air Flow 
oer 


| 100-140 140 
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Test #4 


This test uses a 0.65” tall heat sink that is 3.1” sq. This 
type of heat sink might be used when height is limited 
and there is room to spread out by adding more area to 
the heat sink base. 


"cn > (lo - Taleo 

Oca = (55.3 - 29)/8.73 = 3.0 
The 3.0” x 3.0” graph (Figure A-10) from the applica- 
tion note is used since it is similar to the heat sink used. 
The 3.0” x 3.0” graph predicts: 


Oca = 3.0°C/W @ 100 LFM 
Oca = 2.6°C/W @ 150 LFM 


And the measured value is: 


9ca = 3.0°C/W @ 100-140 LFM 
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Test #5 


The last test was done using a fan/heat sink assembly 
that has become popular for prototyping, debug and 
spot cooling in some situations. We were not able to 
measure the airflow on the processor with this configu- 
ration because the air flow is not directional enough to 
get a reading with the probe available. The case temper- 
ature however was monitored by mounting a thermo- 
couple in the same manner used above. We did modify 
the setup by bringing the thermocouple wires out the 
side to clear the fan. This will change the measurements 
the thermocouple produces and should be factored into 
any data. We do not have any wind tunnel data on the 
fan/heat sink combination. Note that the case tempera- 
ture is within specification. 


Heat Sink Size (Base) = 3.0 Inches Square 


= 
O 
5 
2 
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Airflow 
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Figure A-10. Thermal Resistance 


Table A-8. Test Conditions Test #5 


Heat Sink Size, 
Inches 
Hx W 


HS/Fan 


Ta 
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loc Air Flow 
LFM 


120-160 


3-193 


AP-480 


Conclusion 


Table A-9 shows all the tests in one table. The data 
shows that the suggestions in the application note are a 
very good starting point to begin tuning any Pentium 
processor system and that there is no one cookbook 
answer that fits all systems because of the complexity of 
air flow and variations from each type of system. In- 
deed the results show that airflow can be changed dra- 
matically even in the same system by changing one 
variable. For example test #2 and #3 are exactly the 
same except that board to board spacing was reduced 
significantly. Note that case temperature rose signifi- 
cantly even though the airflow sensor was reading a 
higher value. This suggests that the airflow through the 
heat sink was lower even though the anemometer, 2 
inches away, was reading higher airflow at its position. 
Note also that test #2 more closely approximates the 
wind tunnel test setup because it has open space above 
the board instead of a board nearby. This is also why 
the predicted data versus the measured data is so far off 
for test #3, while test #2 is very close to the predicted 
results. 


Test #1 and #4 demonstrate a fundamental principle 
of the physics involved. If you have the same airflow 


6 ! ; 
intel. 
and must reduce the height of the heat sink, you have 
to spread out the area of the heat sink to compensate 
for the reduced height. Test #1 uses a 1.2” height heat 
sink that is the same size as the package. Test #4 was 


able to produce the same case temperature with a short- 
er heat sink and more area. 


Test #5 demonstrates that a fan/heat sink assembly 
can spot cool effectively if you have enough space above 
and around it to allow the required back pressure. This 
is the only active device tested. If you look back at the 
“A Universe of Solutions to Thermal Management 
Problems” (Figure A-7) chart you will see the reason 
why. While the Pentium processor is at the outer enve- 
lope of passive cooling, this method of cooling still of- 
fers lower cost, power usage and reliability in most cas- 
es. 


Most of all the system designer should never lose sight 
of the real goal which is to keep the junction tempera- 
ture within the operating limit. Since the designer can- 
not measure junction temperature they must use the 
case temperature, which can be measured to ensure 
proper operation for the component. 


Table A-9 


System 
Ta 
27 


Voc = 4.85V for all tests. 


3-194 


PRELIMINARY : 


F ntel . | | | | | AP-480 


APPENDIX B 
HEAT SINK VENDORS 


Aavid Engineering IERC 
One Kool Path 135 W. Magnolia Blvd. 
P.O. Box 400 Burbank, CA 91502 
Laconia, NH 03247 (818) 842-7277 
(603) 528-3400 (818) 848-8872 (FAX) 
(603) 525-1478 (FAX) Contact: Guy R. Addis (Western Region Applications 
Contact: Gary F. Kuzmin (Product Marketing Engineer) 
Manager) 
Thermalloy 
EG&G Wakefield Engineering 2021 W. Valley View Lane 
60 Audubon Rd. Dallas, TX 75234-8993 
Wakefield, MA 01880 (214) 243-4321 | 
(617) 245-5900 Contact: Larry Tucker (VP of Sales and Marketing) 


(617) 246-0874 (FAX) 
Contact: David Saums (Marketing Manager) 
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1.0 INTRODUCTION 


The Pentium™ processor, 82496 cache controller, and 
82491 cache SRAM CPU-Cache Chip Set has been de- 
signed to take advantage of the high performance avail- 
able from the 66-MHz operating frequency. In addi- 
tion, it has been designed to obtain this performance 
without severely adding to the complexity of the system 
design. These benefits are accomplished by dividing the 
chip set into two interfaces. The first is the External 
Interface which is the interface between the CPU- 
Cache core and the rest of the system. It consists of the 
memory bus and the memory bus controller. This inter- 
face has been designed to operate at a fraction of the 
CPU’s frequency or asynchronous to the CPU. These 
options simplify the system design by minimizing the 
portions that must deal with the high frequency signals. 
The second interface is the Optimized Interface which 
is between the Pentium processor and the 82496 cache 
controller and 82491 cache SRAM. This interface is 
tuned for the known configuration options of the: chip 
set and includes specially designed input and output 
buffers optimized for the defined electrical environment 
of each signal path. The specification of the optimized 
interface is defined to accommodate the tuning in- 
volved. 


The purpose of this application note is to provide addi- 
tional explanation of the steps involved in completing a 
chip set design. It includes: 


1. Describing the specifications and how they should be | 


used. 
2. Describing Intel’s I/O buffer models. 


3. Describing the characteristics of printed circuit 
boards and transmission lines. 


4. Stepping through an example of using the informa- 
tion and tools to complete the layout of one signal 
and verify that it meets the published chip set specifi- 
cations. 


In addition, all of the information associated with a 
completed chip set optimized interface design example 
is included. This information allows system designers to 
minimize their effort in implementing the same or simi- 
lar design. 


2.0 A/C SPECIFICATIONS OF THE 
CHIP SET 


The AC/DC specifications for the chip set are pub- 
lished in the latest revision of the Pentium™ Processor 
User’s Manual Vol. 2: 82496 Cache Controller and 
82491 Cache SRAM Data Book. It includes the specifi- 
cations for both the optimized and external interfaces. 
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2.1 Optimized Interface 


The optimized interface is the high-performance inter- 
connect between the Pentium processor and the 82496 
cache controller and 82491 cache SRAM. The input 
and output buffers have been tuned for the defined con- 
figuration and electrical environment (loading, etc.). 
This tuning is what allows the chip set’s CPU-Cache 
core to operate synchronously at 66 MHz. 


There are two types of specifications for signals in the 
optimized interface. The first is Flight Time which is 
used to guarantee that signal timings are met. The sec- 
ond is Signal Quality to guarantee reliable operation. 
I/O buffer models have been provided as a tool to en- 
sure these specifications are met. In this section, flight 
time and signal quality will be discussed. I/O buffer 
models will be left for the next section. 


2.1.1 FLIGHT TIME SPECIFICATION 


2.1.1.1 Purpose of Flight Time Specification 


The purpose of the flight time specification is to guar- 
antee that a signal supplied by a driving component is 
available at the receiving component for sampling. 


It replaces the output valid delay and input setup time. 
The two methods are analogous, except that flight time 
allows the input and output buffer behavior to be 
matched without major impact to designers or the doc- 
umentation. In other words, if component A’s output is 
slower than expected, but component B’s input is faster 
than expected, these two can be traded off without hav- 
ing to change the flight time specification. However, if 
output valid delay and input setup time specifications 
had been used the specifications would have to be 
changed. Note that in both cases the time available to 
the system designer to move the signal from one com- 
ponent to the other is the same. 


2.1.1.2 Definition of Flight Time 


Flight Time is the propagation delay of a signal from a 
driving component to any receiving component. It is 
defined as the time difference between the Vcc/2 (50%) 
level of an unloaded output signal and the Vec/2 (50%) 
level of a receiving signal whose 50% Vcc to 65% Vcc 
rise time is greater than or equal to 1V/ns. Figure 1 
shows the flight time measurement between the 50% 
Vcc points on the unloaded driver and receiver wave- 
forms. 
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If the rise time between the 50% Vcc and 65% Vcc 
points is less than 1V/ns, the determination of flight 
time is slightly more difficult and requires more calcu- 
lation. In this case the 65% Vcc point is extrapolated 


Signal Level 


65% Vecf- - 


50% Vcc 


35% Vcc 


At Receiver Pin 


] 

intel. 
back to the 50% Vcc point using a 1V/ns reference 
slope (i.e., subtract 0.75 ns when Vcc = 5V). Figure 2 


shows the extrapolation from the 65% Vcc point and 
the resulting flight time measurement. 
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Figure 1. Flight Time Measurement 
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Figure 2. Flight Time Extrapolated from 65% Point 


Thus flight time is the longer of T50-50 or Textrapolat- 
ed. Although Figure 1 and 2 only show low-to-high 
transitions, flight time is the worst case of low-to-high 
and high-to-low transitions. Note on high-to-low tran- 
sitions, 65% Vcc is replaced with 35% Vcc. 


In a system environment it will not usually be possible 
to measure the delay of an unloaded driver. Figure 3 
shows the method for measuring flight time in a system 
environment. As shown, the voltage measured at the 
pin of the loaded driver will have a ledge near the cen- 
ter of the transition. According to Transmission Line 
Theory, the time required to reach half the voltage level 
of the ledge is equivalent to the time required for an 
unloaded driver to reach the 50% Vcc level. The oscil- 
lation (if any) seen at the ledge defines the measure- 
ment uncertainty for this technique. 


PRELIMINARY 


To measure flight time via this technique, first measure 
the maximum and minimum voltages of the ledge and 
take the average of these two values, (Vmax + Vmin)/2, 
to arrive at the ledge voltage. Finally, divide the ledge 
voltage by two, (Vmax + Vmin)/4. The result is the 
voltage level that approximately corresponds to the 
point in time at which an unloaded driver’s signal 
would reach the 50% Vcc level. The flight time is de- 
termined by measuring the difference in time between 
the (Vmax + Vmin)/4 point and the extrapolated 50% 
point on the receiver, Textrapolated. The uncertainty 
of this technique is the time difference between the 
Vmin/2 and Vmax/2 points. 


NOTE: 
Figure 3 uses Textrapolated. This assumes the rise 
time between 50% Vcc and 65% Vcc is less than 
1V/ns. If the rise time is equal to or greater than 
1V/ns, the flight time should be measured to the 50% 
Vcc point, T0O-50. 
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Figure 3. In-System Measurement of Flight Time 


2.1.1.3 Clock Skew 


Clock skew has generally been included in system de- 
sign timing analysis. However, as frequencies increase, 
controlling clock skew becomes more important. Clock 
skew is the difference in time of the clock signal arriv- 
ing at different components. It is measured at 0.8V, 
1.5V, and 2.0V. 


In synchronous devices, the clock signal defines the 
point in time in which signals are driven or sampled. It 
is important that all devices have a common reference. 
If the reference varies from component to component, 
the difference must be accounted for to ensure the de- 
vices function properly. In other words, clock skew 
must be subtracted from the clock period when per- 
forming a timing analysis of a system. 


In the CPU-Cache Chip Set, the maximum clock skew 
between components in the optimized interface is a 
specification. This specification is required as a comple- 
ment of flight time to ensure proper functionality. If 
clock skew exceeds the specified limit, the excess must 
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be subtracted from the available flight time or the clock 
period must be increased. 


2.1.2 SIGNAL QUALITY SPECIFICATIONS 


Acceptable signal quality must be maintained over the 
entire operating range to insure reliable operation of the 
chip set. Signal quality consists of four parameters: 
Over/Undershoot, Time Beyond Supply, Ringback, 
and Settling Time. Figure 4 illustrates these signal qual- 
ity parameters and how each is measured for a low-to- 
high transition. In addition to the absolute maximums 
associated with individual signals, each of the signal 
groups defined in the specification must meet maxi- 
mum group average of Over/Undershoot and Time Be- 
yond Supply. The following sections explain each of 
these in more detail. 


Reliable operation means the signals are sampled cor- 
rectly, do not exhibit false transitions, and that the 
long-term reliability of the component is not effected by 
overdriving the inputs. 


PRELIMINARY 


Signal Level 


At Receiver Pin 


AP-481 


Maximum Overshoot (Volts) 


FA a 


I a 


Its) Voc +/- 10% 


Setting Time 


241576-4 


‘Figure 4. Signal Quality Parameter Measured for Low-to-High Transitions 


2.1.2.1 Overshoot 


Overshoot is the maximum absolute voltage a signal 
extends above Vcc or below Vss at the pin of the receiv- 
ing component. Figure 4 shows the above Vcc case of 
overshoot. The overshoot specification is defined for 
use in simulation and assumes that input diodes are not 
included in the input model during simulation. 


The overshoot specification maintains signal reliability 
by limiting the amount of energy that is injected into 
the component. Excessive energy being driven into the 
component can cause both short-and long-term reliabil- 
ity problems. They include Vcc or Vss plane shifts, 
electromigration, excessive ringback when the input di- 
odes turn off, etc. 


PRELIMINARY 


2.1.2.2 Time Beyond Supply 


Time beyond supply is the maximum time a signal ex- 
ceeds Vcc or Vss at the pin of the receiving component 
as shown in Figure 4. If the overshoot voltage is less 
than or equal to 0.5V, time beyond supply can be ig- 
nored. When time beyond supply is being ignored, a 
value of 0 ns should be used for that signal in calculat- 
ing the group average time beyond supply. 


Time beyond supply is a complement to overshoot 
when looking at signal quality. The time the signal is 
beyond the supply is a second factor in limiting the 
energy driven into the component. By limiting the time 
beyond supply, system designers are avoiding or mini- 
mizing the risk of the same reliability issues as de- 
scribed in the section on overshoot. 
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2.1.2.3 Ringback 


Ringback is the maximum absolute voltage at the pin of 
the receiving component below Vcc or above Vss rela- 
tive to Vcc or Vss after the signal has reached its maxi- 
mum voltage level. Figure 4 illustrates how to measure 
ringback. 


Eliminating ringback maintains signal quality by pre- 
venting a signal from re-crossing the threshold, causing 
a false transition to be detected. 


2.1.2.4 Settling Time 


Settling Time is the time required for a signal to settle 
within 10% of its final value referenced from the time 
unloaded driver’s initial crossing of the 50% Vcc 
threshold. Figure 4 shows how settling time is mea- 
sured on both low-to-high and high-to-low transitions. 


The settling time specification is defined to ensure that 
a signal transition has completed and is no longer oscil- 
lating prior to the next transition. This is important to 
avoid forcing a signal to transition a distance signifi- 
cantly greater than Vcc/2. For example, if a signal is 
still not settled at the time of its next transition, it may 
be at a voltage above Vcc such as 6.5V. Assuming Vcc 
= 5V, the transition requires the voltage to swing from 
6.5V (instead of 5V) to 2.5V. This added voltage dis- 
tance translates into added flight time. 


2.1.2.5 Group Averages 


A Maximum Group Average specification is defined 
for Overshoot and Settling Time for each signal group. 
The group average is calculated by summing the maxi- 
mum overshoot level or settling time for each signal in 
the group and dividing by the number of signals in the 
group. 


The purpose of the group average specifications is to 
limit the amount of energy driven into the device. 
While each input can handle a certain amount of ener- 
gy as limited by the Overshoot and Time Beyond Sup- 
ply specifications, the overall part or portions of the 
part also have limits. These limits are maintained by 
ensuring the energy driven into these portions does not 
exceed a certain level. 


2.2 External Interface 


The External Interface is the interface between the 
CPU-Cache core and the rest of the system. It consists 
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of the memory bus and the memory bus controller. 
This interface has been designed so that it can operate 
at a fraction of the CPU’s frequency or asynchronous 
to the CPU. These options simplify the system design 


by minimizing the portions that must deal with the 
high frequency signals. 


The specifications for the external interface are defined 
to allow system designers to connect the CPU and 
Cache components to other devices (ASICs, PLDs, 
memory, etc.). The external interface signal specifica- 
tions are the more traditional output valid delay and 
float time and input setup and hold time. In addition, 
I/O buffer models have been provided as a tool to assist 
system designers. : 


2.2.1 OUTPUT VALID DELAY AND FLOAT TIME 


Output Valid Delay is the amount of time it takes the 
signal to transition referenced from the clock edge. The 
maximum output valid delay is the earliest point in 
time that the signal can be assumed valid at the pin of 
the device. This timing is referenced from the clock 
edge and is measured at 1.5V as illustrated in Figure 5. 


NOTE: 
This specification is defined assuming Cy = O pF; 
therefore, system designers must account for all delay 
added by the signal traveling to the receiving device. 


The maximum output valid delay is used by system 
designers to perform a worst case timing analysis to 
ensure that setup times are met at receiving devices. 


The minimum output valid delay is the earliest valid 
data from the previous clock will begin transition after 
the clock edge. This timing is also referenced from the 
clock edge and is measured at 1.5V as illustrated in 
Figure 5. 


The minimum output valid delay is used by system de- 
signers to perform a worst case timing analysis to en- 
sure that hold times are met at receiving devices. In 
addition, on I/O or tristate pins, it is used to ensure no 
bus contention exists due to multiple devices driving 
the bus simultaneously. 


The maximum Output Float Time is the amount of 
time it takes to float a signal that was driven in the 
previous clock. This timing is also referenced from the 
clock edge and is illustrated in Figure 6. Note that the 
minimum valid delay determines how long data from 
the previous clock remains valid as shown in Figure 6. 
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Tx = valid delay 
Figure 5. Output Valid Delay 
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max. float delay 
min. valid delay 
max. valid delay 


Figure 6. Output Float Time 


2.2.2 INPUT SETUP AND HOLD TIME The input setup time is used by system designers to 

perform a worst case timing analysis to verify that fast 
Input Setup Time is the amount of time a signal must enough drivers have been chosen for ASICs or other 
be valid at the component’s input pin prior to the clock devices and that the signals are able to travel across the 
edge it is sampled. The minimum input setup time is board layout in the allotted amount of time. 


the latest point in time that the signal can be assumed 
valid at the pin of the device. This timing is referenced 
to the clock edge and is measured at 1.5V as illustrated 
in Figure 7. 
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Figure 7. Input Setup and Hold Time 


Input Hold Time is the amount of time a signal must be 
valid at the component’s input pin after the clock edge 
is sampled. The minimum input hold time is the earliest 
point in time that the signal can start its next transition 
at the pin of the device. This timing is referenced to the 
clock edge and is measured at 1.5V as illustrated in 
Figure 7. 


3.0 I/O BUFFER MODELS 


3.1 Description of the First Order I/O 
Buffer Model 


The first order I/O buffer model is a simplified repre- 
sentation of the complex input and output buffers used 
in the Pentium processor, 82496 cache controller, and 
82491 cache SRAM. Figure 8 shows the structure of 
the input buffer model and Figure 9 shows the output 
buffer model. Tables 1 and 2 show the parameters used 
to specify these models. 
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Figure 8. First Order Input Buffer 


Table 1. The Parameters Used in the Specification of the First Order Input Buffer Model 


3-206 


cn | Minimum and maximum value ofthe capactance ofthe nput bufermodel 
[tp | Minimum and maximum veh ofthe packageindutance 
[cp | Minimum and'maximum vale ofthe package capactanco 
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Figure 9. First Order Output Buffer 


Table 2. The Parameters Used in the Specification of the First Order Output Buffer Model 


The first order I/O buffer parameters for the chip set 
are published in the latest revision of the Pentium™M 
Processor Data Book. \t includes the minimum and 
maximum values for each parameter allowing simula- 
tions for both the fast and slow condition to be per- 
formed. 


The key to the first order model is that it is a simplistic 
representation of the input and output buffers. The pa- 
rameters are easy to use in a variety of simulators and 
provide the needed accuracy to complete a chip set de- 
sign. In addition, the simplicity greatly reduces the 
compute time required to perform simulations as com- 
pared to full transistor and process models. 


4.0 HIGH FREQUENCY DESIGN 
CONSIDERATIONS 


Any board interconnection is a transmission line by 
definition. However, as a general rule, the effects of 
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Description 


dV/dt Minimum and maximum value of the rate of change of the open circuit voltage source used 
in the output buffer model 


Minimum and maximum value of the output impedance of the output buffer model 


Minimum and maximum value of the capacitance of the output buffer model 


Minimum and maximum value of the package inductance 


Minimum and maximum value of the package capacitance 


modeling an interconnect/trace as a transmission line 
are negligible at low frequencies. This is because the 
reflections get masked since the propagation is short 
with respect to the signal rise time. In this case, system 
interconnects can be modelled as lumped loads with no 
sacrifice to accuracy. As the frequency at which signals 
change increases, the rise time becomes shorter and 
transmission line properties become significant and 
must be considered. Reflections, interference, and noise 
cause measureable changes in the appearance or behav- 
ior of signals at the higher frequencies. They can slow 
the transition time or cause signal quality violations. 
Figures 10 and 11 illustrate the effect of modeling 
traces as lumped loads or as transmission lines. Figure 
10 shows a block diagram of a lumped load model and 
the resulting simulation and Figure 11 shows a block 
diagram of a transmission line model and the resulting 
waveform. The transmission line case shows the effects 
of reflections and how the signal varies at each compo- 
nent. These details are not shown in the lumped load 
case. 


3-207 


AP-481 ‘rs intl. 


Pentium(tm) 
Processor 


241576-10 


Simulation Assuming Lumped Load 
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Figure 10. Lumped Load Simulations 
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Figure 11. Transmission Line Simulation 


To predict a trace’s behavior and eliminate the negative driving component’s output buffer model, the charac- 
effects, it is important to properly model board inter- teristics of the trace, and the receiving component’s in- 
connects as transmission lines. A model consists of the put buffer model. The I/O buffers are modelled using 
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the parameters published in the latest revision of the 
Pentium™ Processor User’s Manual, Vol. 2: 82496 
Cache Controller and 82491 Cache SRAM Data Book. 
The trace characteristics are a function of the actual 
board and the material used in its construction. Char- 
acteristic impedance (Zo) and propagation delay (Tpd) 
are the primary characteristics needed. These two pa- 
rameters, along with length, can be used in many simu- 
lators to represent the transmission line as shown in 
Figure 12(a). 


Some simulators may not have a single representation 
for a transmission line. These simulators require the 
user to model the transmission line in a more basic 
form. In addition to characteristic impedance and prop- 
agation delay, transmission lines can be characterized 
by their characteristic inductance (Lo) and characteris- 
tic capacitance (Co). The following equations can be 
used to convert between these four parameters: 


Lo = Zo* Tpd 
Co = Tpd/Zo. 


Zo, on | 


a 

intel. 
Figure 12(b) illustrates the model of a transmission line 
using Co and Lo. The number of L-C links/inch in the 
chain should be chosen such that (LC)!/2 < Tr. Asa 
good rule of thumb, two to four links per inch is a good 
starting point. The value of L and C is determined by 
dividing the characteristic impedance by the number of 
links/inch. A transmission line also has a resistive com- 
ponent, Ro. However, the Ro value is usually assumed 
negligible and omitted from the model. Its only effect is 
to cause a voltage loss between the driver and the re- 
ceiver. Since Ro’s value is negligible, the voltage loss is 
very small and can also be ignored. 


L = Lo/n 
C = Co/n 
n = number of links/inch 
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Figure 12. Representation of a Transmission Line 


In order to complete the model, the actual parameter 
values must be determined. That means the next step is 
to determine the values of Zo and Tpd. To do this the 
designer must understand the construction of the trans- 
mission line. In particular that means understanding 
how the printed circuit board is constructed and the 
geometry of the various layers and traces. 
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4.1 Printed Circuit Board 


A printed circuit board consists of some number of sig- 
nal layers and power and ground planes separated by a 
dielectric material. An example is illustrated in Figure 
13. 
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Figure 13. Printed Circuit Board 


sist of a signal trace that is separated from a power or 


ground plane by a dielectric as shown in Figure 14. 


Although there are many types of transmission lines, 
those most commonly used on printed circuit boards 
are microstrip lines and striplines. Microstrip lines con- 
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Figure 14. Microstrip Trace 
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For a microstrip 


The characteristic impedance is a function of the dielec- 


tric constant and the board geometry. 


trace this is given by: 
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where e; is the relative dielectric constant of the board 
material. The propagation delay is a function of the 
dielectric constant only. For a microstrip trace this is 
given by: 


tpd = 1.017 (0.475 ¢, + 0.67)1/2 ns / ft 


intel. 


Striplines consist of a signal trace that is located in a 
dielectric material between two power or ground planes 
as shown in Figure 15. 


Stripline 
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Figure 15. Stripline Trace 


The characteristic impedance is a function of the dielec- 
tric constant and the board geometry. For a stripline 
trace this is given by: 


e 


4 
in eas: EERE Ohms 


(€r) ~ 0.67 a(t + 0.8W) 


where e, is the relative dielectric constant of the board 
material. The propagation delay is a function of the 
dielectric constant only. For a stripline trace this is giv- 
en by: 


tpd = 1.017 (.,)1/2 ns/ft 


Signal traces that are located on the external layers of 
the board can be modelled using the microstrip charac- 
teristics. Signal traces that are located on internal layers 
between the power and ground planes can be modelled 
by using the stripline characteristics. 


4.2 Transmission Line Behavior 


Now that the parameters needed to represent a trans- 
mission line are complete, the next step is to look at 
how the transmission line behaves or effects signals that 
are transmitted on it. The four primary effects are sig- 
nal propagation and reflection, crosstalk, and skew. 
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4.2.1 SIGNAL PROPAGATION AND 
REFLECTION 


Ideally, a signal that is driven by one device to another 
across a trace will reach the receiving device instantly 
and without any distortion. In reality, this is not the 
case. Because of the resistive, capacitive, and inductive 
components of a transmission line and the attached 
loads, the voltage and current of a signal may change as 
the signal travels along the trace and may vary over 
time. 


For simplicity, the examples that follow will assume 
lossless transmission lines. In other words, the trans- 
mission line itself does not cause an attenuation of the 
voltage as it travels along the trace. The voltage loss 
seen at the receiving end is a function of the resistive 
component of the transmission line which is negligible. 


An example of a signal propagating along a transmis- 
sion line is shown in Figure 16. The voltage source at 
the left launches a wave onto the transmission line. The 
voltage of the wave is equal to the voltage division of 
the voltage source resistance and the line inductance. 


Vi = Vin * Zo/(Zo + Rs) 
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This waveform is propagated down the transmission 
line without any voltage loss or change in the wave- 
form. The only effect of the transmission line is to delay 
the signal from reaching the receiving end. When the 
wave reaches the receiving end, it is reflected back 
towards the source. The reflection is proportional to the 
initial wave by an amount called the Reflective Coeffi- 
cient. A reflective coefficient exists for both the source 


and the load. The values are a function of the source or 
load resistance and the lines characteristic impedance. 


pc = (R, — Zo)/(R, + Zo) 
Ps = (Rg — Zo)/(Rg + Zo) 


The value of the initial reflection at the load is: 


Vr = Vi* py 
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The reflections can be caused by any discontinuity on 
the line. The discontinuity can be caused by a mismatch 
in impedance between the source or load and the char- 
acteristic impedance of the trace, branches in the trace, 
vias, or bends and angles in the trace. Here the disconti- 
nuity between the source and load are used as an exam- 
ple because they are probably the most prominent. 
Each reflection can attenuate or reinforce the wave de- 
pending on the phase of the reflection. The reflections 
continue indefinitely; however, with each reflection the 
magnitude of the voltage decreases and the line ap- 
proaches a steady state value. A rule of thumb is that 
by the third or fourth reflection the value is negligible. 


Signal (Wave) Propagation (Resistive Loads) 


Rs 


Initial Wave 
Vin 


Reflected Wave 


Now that the voltage of each reflected waveform can be 
calculated, the next step is to sum these values to deter- 
. mine the voltage measured on the line at any point in 
time. The superposition principle comes into play. It 
states that the voltage/current at any point on a trans- 
mission line equals the sum of the voltages/currents of 
all the signals (waves) that have passed that point. In 
other words, as each reflection passes the point of mea- 
surement, it is added to the previous voltage seen at 
that point. Figure 17 illustrates the voltage seen at the 
midpoint of the circuit shown in Figure 16 over time. 
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Figure 16. Signal Propagation Along a Transmission Line with Resistive Loads 


L— > Vi=Vin 


Zo +Rs 
(Voltage Divider) 


Vin 


(Reflective Coefficient) 
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v.__- Voltage at Line’s Center Point 


Vi + Vr 


Vi 


Td 3Td/2 
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Figure 17. Voitage at the Midpoint 
of Circuit in Figure 16 
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From time 0 until Td/2 the voltage is OV at the mid- 
point because the initial wave has not yet reached the 
midpoint. At time Td/2, the initial wave reaches the 
midpoint and the voltage is Vi. This wave travels down 
the trace until it reaches the load and a reflection oc- 
curs. The reflection begins traveling back towards the 


a ‘ \ 


a 
intel. 
source, but does not reach the midpoint until time 3Td/ 
2. At that time the voltage increases by Vr, the reflec- 
tions voltage as shown in Figure 17. The following 


equation determines the voltage at any given point on a 
trace at the given time. 


V(x,t) = [Zo / (Zo + Zs)] *{ Vin [t — (t — tpd * x)] * U(t — tpd * x) 
+ p. * Vin [t — (t — tpd * (2L — x))] * U(t — tpd * (2L — ») 
+ pL * ps * Vin [t — (t — tpd * (2L + x))] * U(t — tpd * (2L + x) 
+ p_2* pg Vin [t — (t — tpd * (4L — x))] * U(t — tpd * (4L x) 
+ pL? * ps2 * Vin [t — (t — tpd * (4L + x))] * U(t — tpd * (4L + x) 
wat 


U(x) = unit step function 


Tpd = propagation delay of signal traveling along the transmission line (ns/ft) 


As reflections occur on the line they can cause slower 
signal transitions, overshoot, undershoot, ringing, and 
other undesirable effects. Although many of the effects 
of reflections are negative, sometimes designers take ad- 
vantage of constructive reflections to decrease the time 
it takes for the voltage to reach its final value at the 
destination. In general, designers try to minimize the 
magnitude of reflections. 


System designers can do several things to reduce or 
minimize reflections: reduce angles (specifically 90° 


angles in traces, minimize the number of vias, and use 
termination when necessary). 


Figure 18 illustrates how angles can be reduced by us- 
ing 135° bends instead of 90° bends. The 135° bend 
approximates a smooth curve more closely than the 90° 
bend. The discontinuity occurs in the 90° bend because 
the trace is wider through the bend and therefore the 
impedance is altered by the geometry of the trace. 


> 
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Figure 18. Eliminate 90° Angles 


Figure 19 illustrates a way to reduce the number of 
discontinuities by minimizing the number of vias. Once 
again, a via causes a change in the path of a signal 
much like the 90° angles do. In addition, the geometry 
of a via is generally wider or thicker than the rest of 
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the trace resulting in a different impedance for that por- 
tion of the interconnect. The change of impedance in 
the path causes the discontinuity and the resulting re- 
flections. 
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Figure 19. Minimize Vias 


It is not always possible to eliminate all the discontinui- Zo = Zterm + ZL 

ties or mismatches in impedance. When this is the case, and 

it is sometimes necessary to use a technique called Ter- Z, = Zins + Ze 

mination to artificially make the mismatched imped- 

ances appear matched. This technique is normally used Several techniques of termination exist. They are Paral- 
to match a traces characteristic impedance with either lel, AC or RC, and Series termination. Each technique 
the load or sources impedance. has its own advantages and disadvantages as summa- 


rized in Table 3. 
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Table 3. Termination Techniques 


Type of Advantages Disadvantages 
Termination 


Parallel 
lo 
joe 
V2 
[ C. 
Rs Zo 
so REE 
R1 | | R2 = Zo 
R2 = 2.6Zo : 
* Ri = R2/1.6 


AC or RC 
As fo 
R ten = ZO | 
J. R tom Crew = t Rise/Fall =o 
| 1 


4.2.2 CROSSTALK 


Crosstalk is another side effect of transmission line in- 
terconnnects. Crosstalk is the result of fields from adja- 
cent traces interacting with each other. The interaction 
can alter the characteristics of a driven line or cause 
noise to be coupled into passive lines. 


Crosstalk can be characterized by two parameters: Mu- 


tual Inductance, L,,, and Mutual Capacitance, C,,, as 
shown in Figure 20 and 21, respectively. These two 
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@ eliminates reflections 
at receiver 

® good overshoot 
suppression 

@ no added delay 


e high power dissipation 

e requires Zo > 1009. 
to avoid exceeding dc 
current limit 

e reduced voltage swing 


Cm 


© low power 
consumption 

© full voltage swing 

@ eliminates initial 
reflection at receiver 


e Cterm adds capacitive 
Load to driver 

e added delay due to 
RC time constant 

R tem ® component size and 

Cus count 

® no additional loading 
on driver 

@ no additional 
charging time 

e low power 
consumption 

© eliminates secondary 
reflection at source 


e added delay 


parameters represent the inductive and capacitive val- 
ues that exist between two adjacent lines. The induc- 
tance allows a current in one line to induce a voltage in 
a second line. 


The capacitance allows a voltage on one line to induce a 
current in the second. 


ImM2 = Cm * A(V1V2)/At 
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These mutual components have an additive effect to the 
L and C used to characterize each transmission line. To 
see what effect this has, examine the two components 
separately. First, Figure 20 illustrates two parallel 


traces with their inductive components and a mutual 
inductance between the two. 


+V1-- 
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Figure 20. Inductive Components of 
Two Parallel Traces 


The voltage seen on each line is given by: 


V1 = Vi + Vm = (L4 * Aly/At) + (Lm * Alp/At) 
Vo = Vio + Vm= (Lo * Alo/At) + (Lm * Aly/At) 


To see what effect this has assume L; = Ly? and the 
magnitude of AI,;/At = the magnitude of AI>/At. This 
allows the above equations to be simplified to: 


Vi = Vo = (Ly + Lp) * Aly/At 
(Current in same direction) 
and 


Vi = —Vo = (Ly — Lp) * Aly/At 
(Current in opposite direction). 


From these equations the effective inductance seen on 
either trace is: 


Lore = Ly + Lm 
(Current in same direction) 


and 


Lon = ba = bm 
(Current in opposite direction). 
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Therefore, if the currents are flowing in the same direc- 
tion the effective inductance of each trace is increased. 
If the currents are in opposite directions the effective 
inductance of each trace is decreased. 


Secondly, Figure 21 illustrates two parallel traces with 
their capacitive components and a mutual capacitance 
between the two. 
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Figure 21. Capacitive Components of 
Two Parallel Traces 


The current seen in each line is given by: 


ly = I|o1 + Iw 
(C1 * AV4/At) + (Cy * A(Vy — Vo) /At) 
((C; + Cr) * AV4/At) — (Cr * AVo /At) 


lo = Ico + Im 
(Co * AVo/At) + (Cm * A(V2 — V4) /At) 
((Co + Cm) * AVo/At) — (Cm * AV, /At) 


Using the same assumptions that C; = C) and that the 
magnitude of AV;/At = the magnitude of AV>/At al- 
lows the equations to be simplified to: 


|, = lo = Cy * AV;/At (Voltage change in the same 
direction on both traces) 


and 


ly = —lo = (Ci + 2C,) * AV;/At (Voltage change in the 
opposite direction on each trace). 
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From these equations the effective capacitance seen on 
either trace is: 


Cefp = C1 (Voltage change in same direction) 
and 


Cott = Cy + 2Cm (Voltage change in opposite 
directions). 


Therefore, if the voltages are changing in the same di- 
rection the effective capacitance of each trace is un- 
changed or decreases. If the voltages are changing in 
opposite directions the effective capacitance of each 
trace is increased. See Figure 22. 


If Leg and Copp are used to determine Zo and Tpd, the 
following results: 


Zo = (Leff/Ceff)1/2 Tpd = (Leff * Ceff)1/2 


Same Direction 


Lett cl Cott a =p zo Me Tpa? 


Opposite Direction 


Ln Con Me Bx Zo Tn? 
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Figure 22. Effect of Changing Voltages in the 
Same or Opposite Directions 


Electrons travel at the speed of light, so Tpd can never 
decrease. Therefore, Tpd either remains constant or in- 
creases. 


This altering of Zo and Tpd by crosstalk explains why 
termination is never 100% effective. The crosstalk leads 
to a variation between the targeted Zo and the actual 
Zo. Termination is usually defined to match the target- 
ed Zo’s. The result is an interconnect that is not per- 
fectly matched via termination. 
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5.0 CHIP SET DESIGN 


As simulation tools have improved it has become easier 
to test design assumptions before actually spending the 
time or money to build a printed circuit board. In addi- 
tion, the frequencies at which signals switch have also | 
increased and complicated the process of designing a 


board that ensures signals reach their destination at the 


correct point in time and maintain a reasonable level of 
signal quality. To better predict signal behavior and 
minimize the need for board rework or revision, many 
designers are simulating their board layouts before 
building a board. The complexity of the optimized in- 
terface of the CPU-Cache Chip Set is a prime candidate 
for this type of approach. In this interface designers 
must ensure that the signals accurately travel along the 
interconnects at very high frequencies (i.e., 66 MHz). 
As discussed, transmission line effects become a more 
dominant influence on signals switching at these fre- 
quencies. It is important to take these effects into ac- 
count to ensure that no specification violations occur. 


A possible scenario for designing the optimized inter- 
face is shown in Figure 23. As always the first step is to 
understand the specifications. This document along 
with the published specifications should help complete 
this step. Based on these specifications, system geome- 
try requirements, and an understanding of the board’s 
basic electrical characteristics, a first pass component 
placement and routing can be completed. Once the 
routing is complete or possibly as part of the routing, 
individual traces should be simulated to determine their 
electrical behavior. This includes examining both flight 
time and signal quality for each signal and determining 
if it meets the specification. Any signals that violate the 
specification must be modified. Portions of this docu- 
ment will provide some information and guidelines on 
how to modify or route the traces to meet the specifica- 
tions. With each change, the routing should be re-simu- 
lated to ensure the specifications are still met. 
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Modify Layout 
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from Simulation 
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Measure Board 
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Figure 23. Process for Completing Optimized Interface Design 


Once all the specifications are met, it is time to build 
the board. The goal is that once this board is manufac- 
tured and the components are installed, it will meet 
specification without any changes. However, this must 
be verified by making actual measurements on the 
board to verify all of the flight time and signal quality 
specifications are met. It is also beneficial to make sure 
that the actual measurements correlate to the predicted 
results from simulation. This is especially helpful if any 
corrections are required to bring the board within spec- 
ification. 


The next couple of sections will describe the require- 
ments and guidelines that should be followed while 
making these simulations and measurements. 


5.1 Simulation Environment 


The environment chosen to simulate the optimized in- 
terface is very critical. A number of different options 
are available on the market today. It is the system de- 
signer’s responsibility to select the option best suited for 
their design requirements. These requirements will in- 
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clude the accuracy of the results; as well as, how easy it 
is to import schematics, layout routing, or modeling 
parameters. 


5.1.1 SIMULATION REQUIREMENTS 


When simulating the optimized interface to determine 
flight time and signal quality, it is important that the 
appropriate modeling parameters are used. The I/O 
models are provided with minimum and maximum val- 
ues for each parameter. Using these values the fast and 
slow corners of the buffer’s behavior can be modeled. 
In addition, the printed circuit board can be modeled 
for its fast and slow corners. Table 4 restates the char- 
acteristics of a printed circuit board. 


Flight time is determined by simulating with the slow 
corner used for all parameters. In this corner signals 
require the longest amount of time to transition and 
reach their destination. The fast corner is used to simu- 
late signal quality. In the fast corner, signals transition 
their fastest and are therefore their noisiest. Table 5 
summarizes the parameter values used to simulate for 
flight time and signal quality. 
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Table 4. Parameters Used to Specify Printed Circuit Board Characteristics 


Parameter 


r-~]| an vane 
Characteristic Impedance | Zo ({) Minimum and maximum impedance for signal traces on each layer 


Propagation Delay S (ns/ft) | Minimum and maximum propagation delay for signal traces on each 
layer 

Via Capacitance Cvia (pF) | Minimum and maximum capacitance of a via used to pass a signal 
from one layer to another of the PC board 


Table 5. Parameter Values Used to Simulate Flight Time and Signal Quality 


Input Buffer 


Printed Circuit Board 


These values should be used to define the simulation 
model files used to simulate for flight time and signal 
quality. 


While simulating the two corners it should become ob- 
vious that there will be trade-offs in optimizing for one 
or the other. Some sacrifices in signal quality may be 
required to ensure flight time specifications are met, or 
vice versa. 
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— Parameter Flight Time Signal Quality 


MEL ES? SACO SAE Bet 
Output Buffer dV/dt Po inate te 


5.2 Routing Signal Traces for Their 
Optimal Performance 


Priority should be given to optimizing the performance 
of the signals in the optimized interface. For the 256K 
byte layout example that Intel completed, the signals 
have been divided into the categories listed in Table 6. 
These categories are based on fanout and connectivity 
characteristics. 
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Table 6. Optimized Interface Signal Categories 


High Address 
(connected to PP and CC) 


PP-CC Control 
(connected to PP and CC) 


PWT, SCYC 


WB/WT # 


PP-CS Control 
(connected to PP and CSs) 


CC-CS Control : 
(connected to CC and CSs) 


PP = Pentium processor 
CC = 82496 cache controller 
CS = 82491 cache SRAM 


Within each category the routing or topology should be 
defined to minimize delay while maintaining acceptable 
signal quality. To do this and maintain the manufactur- 
ability of the board, rules were defined to govern the 
line lengths for each segment of a topology. To develop 
these rules some analysis of board characteristics and 
signal behavior is necessary. 


5.2.1 RULES FOR OPTIMIZING SIGNAL 
ROUTING 


Both the fast and slow corners must be considered to 
ensure both flight time and signal quality are met by 
optimizing a signal’s routing. 


Flight time is minimized by optimizing each intercon- 
nect to minimize the distance the signal must travel and 
the loading presented to the driver. The dominant op- 
position to minimizing these factors is the printed cir- 
cuit board’s geometry requirements (i.e., physical dis- 
tance between components and component placement) 
and electrical characteristics (propagation delay and 
characteristic impedance). 


The strategy used to optimize each interconnect for sig- 
nal quality is to make each net’s routing electrically 
symmetric. This is especially important on heavily 
loaded nets. 
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Signal 


Low Address a A3-A16, HITM#, W/R# 
(connected to PP, CC, and CS) 
A17-31, BTO-3 


Driven by PP: ADSC#, AP, CACHE #, D/C#, LOCK#, M/IO#, PCD, 


Driven by CC: AHOLD, BRDYC1 #, EADS#, INV, KEN#, NA#, 


BLAST #, BRDYC2#, BUS#, MAWEA#, MCYC#, WBA, WBTYP, 
WBWE #, WRARR #, WAY 


Other CC Control BLEC#, BOFF# 
Byte Enables | BEO # -BE7 # 
CPU Data and Parity CDO-CD63, CPO-—CP7 


Electrically symmetric means the delays of each branch 
within the net are equal when viewed from the driver. 
Figure 24 shows a topology from the 256 Kbyte layout 
example that illustrates this principle. For this topology 
with the Pentium processor driving, the symmetry is 
best when the delay from the Pentium processor to the 
82496 cache controller is equal to the delay from the 
Pentium processor to the farthest 82491 cache SRAM. 
By making these delays equal, the round-trip delays are 
also equal, and therefore any reflections return to the 
Pentium processor simultaneously. By returning simul- 
taneously, the reflections can rapidly cancel each other, 
resulting in the waveform settling quickly. 


If these two delays are not equal, asymmetric reflec- 
tions return to the Pentium processor at different times, 
and do not cancel each other. The result is a complex 
interference pattern that generates considerable ringing. 
In some cases this ringing can last for more than one 
clock cycle. — 


5.2.2 DETERMING THE OPTIMAL NET 


There are two methods of optimizing the line lengths 
and relationships of traces within a net. One uses an 
asymmetry factor [5] to identify the optimal relation- 
ship. The other uses settling time to find this relation- 
ship. 
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Using the asymmetry factor to optimize the symmetry 
of the net in Figure 24, the input impedance (frequency- 
domain) of each branch was calculated from the driver’s 
point of view. The branch impedances were compared to 
each other and the difference was integrated over all 


82491#2 
cd 08-15 


82491#9 82491#6 
(parity) cd 40-47 
0-3 


82491#10 82491#4 
(parity) cd 24-31 


Ay 


82491#8 
cd 56-63 


a 

intel ° 
frequencies, resulting in a symmetry energy factor 
which quantifies the amount of symmetry in the topolo- 


gy. Figure 25 also shows a plot of this factor as a func- 
tion of the lengths of segments L__a and L__b. 


82491#1 
cd 00-07 


82491#5 | Pentium(tm) 
cd 32-39 Processor 
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cd 16-23 


82491#7 
cd 48-55 
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Figure 24. Energy Minimization for a Given Topology 
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Figure 25. Energy Minimization for a Given Topology 


There is a strong correlation between the asymmetry 
factor and the net’s signal quality. This is reinforced by 
simulating the topology using values for L__a and L__b 
from the plot in Figure 25. Figure 26 shows the wave- 
forms obtained by simulating the topology with sym- 
metric values for L__a and L__b from along the energy 
minimum. The line of points in the plot of Figure 25 
where the energy minimum occurs corresponds to to- 


: PRELIMINARY 


pologies that are electrically symmetric. An asymmet- 
ric topology is obtained by using values for L__a and 
L__b that lie away from the energy minimum. The 
waveform obtained by simulating this asymmetric case 
is also shown in Figure 26. Notice the difference in 
signal quality in the two plots. The symmetric case is 
much better than the asymmetric. 
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Symmetric (La = 1.8”, Lb = 5.9”) 
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Asymmetric (La = 1.8”, Lb = 4.0”) 
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Figure 26. Symmetric Versus Asymmetric Values for La and Lb 
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This technique can be used to optimize the routing of 
all heavily loaded signals in a chip set design. From the 
energy factor plot rules can be defined to govern the 
segment lengths needed to minimize the energy factor 
and obtain the specified signal quality. 


The 256K layout example that follows used this tech- 
nique extensively to route the heavily loaded signals. 
For each signal group or topology, the asymmetry ener- 
gy factor was calculated as a function of the topology’s 
segment lengths and a set of rules defined to govern the 
segment lengths required to provide a routing that 
meets the signal quality specifications. 
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Similar results can be determined by using settling time 
to the optimal routing. To optimize the symmetry of a 
net, the settling time is plotted against line length. The 
minimum settling time occurs at the point where the 
net is balanced. Figure 27 shows a settling time plot for 
the net in Figure 24. Settling time is plotted against the 
true length for the segment between the Pentium proc- 
essor and the 82496 for a given length between the Pen- 
tium processor and 82491. For La= 1.8 inches the set- 
tling time approach recommends Lp = 5.8-—6.0 inches. 
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Figure 27. Settling Time Versus Line Length 


5.2.3 SERPENTINE STRUCTURES 


Serpentine structures are one design technique that can 
be used to assist in balancing the interconnect delays 
between the Pentium processor, 82496 cache controller, 
and 82491 cache SRAM components. The structure is 
used to add length to specific traces within the nets. 
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The goal of adding this length is to make the net “bal- 
anced” or electrically symmetrical. In particular, this 
technique has been used to add length to the trace be- 
tween the Pentium processor and 82496 cache control- 
ler so that it is electrically symmetric with the traces 
between the Pentium processor and 82491s of the same 
net as shown in Figure 28. 


3-225 


AP-481 


82491#9 


(parity) 
0-3 


82491#10 


(parity) 
4-7 


82491#2 
cd 08-15 


82491#6 
cd 40-47 


82491#4 
cd 24-31 


82491#8 
cd 56-63 


82491#1 
cd 00-07 


82491#5 
cd 32-39 


82491#3 
cd 16-23 


82491#7 
cd 48-55 


Pentium(tm) 
Processor 


241576-35 


Figure 28. Balancing Nets Using Serpentine Structures 


Due to the parallel traces that make up the serpentine, 
cross-coupling may occur between the individual por- 
tions of the serpentine. The cross-coupling may cause 
the propagation velocity and characteristic impedance 
of the serpentine to differ from those of a straight line 
of equivalent length. In general, the propagation veloci- 
ty may be greater and the characteristic impedance less 
for the serpentine structure. To simplify the simulation 
environment for the CPU-cache-chip set design exam- 
ple, the added trace length was assumed to be equal to 
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the length of trace used to make the serpentine. See 
Figure 29. 


Experiments were performed to confirm that this as- 
sumption was valid. The experiments involved Time 
Domain Reflectometry (TDR) and Time Domain 
Transmission (TDT) measurements on various serpen- 
tine configurations. The height of the serpentine, h, and 
the separation, s, were varied. 
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Figure 29. Parameters of a Serpentine Structure 


It was found that as the separation was increased or the 
height decreased the propagation velocity increased. 
Amount of increase varied from being almost negligible 
to being approximately 40% for short, closely spaced 
serpentines. Also, it was observed that as the height 
decreases the magnitude of the decrease in impedance 
gets smaller, with the largest decrease in impedance, 
approximately 12%, seen when the height and separa- 
tion are at their smallest. The serpentines used in the 
CPU-cache-chip set design example were not the worst 
case configuration. Based on the experiments, the ser- 
pentines should cause less than 10% decrease in the 
characteristic impedance and less than 30% decrease in 
the propagation delay. 


Both of these variations appear considerable at first 
glance. However, serpentines, as used in the CPU- 
cache-chip set design example, account for only a small 
percentage of the entire trace length of a net. For exam- 
ple, if the serpentine is only 25% of the total trace 
length and the total propagation delay is 2 ns, repre- 
senting the trace as a straight line length only intro- 
duces a maximum error of about 150 ps. This is deter- 
mined by assuming the 25% of trace accounts for 
0.5 ns delay and a 30% decrease is 150 ps. Based on the 
small amount of error introduced, it was decided that a 
straight line representation was accurate enough and 
simplified the simulations. 
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Note the variation in effects caused by the serpentines. 
Each design should perform similar analysis if a differ- 
ent serpentine structure than that used in the CPU- 
cache-chip set design example is used. 


If the designer chooses to include the propagation ve- 
locity and characteristic impedance variations in the 
simulations, the only change is to represent the serpen- 
tine length of trace as a separate length with the differ- 
ent characteristics. 


6.0 EXAMPLE: DESIGNING THE A12 
Aas FOR THE CPU-CACHE CHIP 


The Al2 net is one of the more complex nets in the 
optimized interface of the CPU-Cache Chip Set. The 
signal is driven by the Pentium processor to the 82496 
cache controller and all the 82491 cache SRAMs dur- 
ing memory reads. In addition, the 82496 cache con- 
troller drives this signal to the Pentium processor 
during inquire cycles. 


In routing A12 net all of the guidelines and techniques 
described were used. An initial routing of the A12 net 
was made using an H-type routing and attempting to 
make all of the interconnects as short as possible. The 
resulting topology is shown in Figure 30. 
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Figure 30. Initial Topology for A12 Net 


The Al12 net was simulated assuming the Pentium 
processor is driving the net. Quad Design’s TLC was 
used to simulate the A12 net. For flight time the slow 
corner is used. The slow corner uses the model parame- 
ters as defined in Table 5. The actual values can be 
obtained from the Pentium™ Processor User’s Manual. 
A TLC control file calls the appropriate model and 
topology files along with setting the needed measure- 
ment points to complete the flight time simulations. 


3-228 


Initially, the line lengths or segments between compo- 
nents was assumed to be the straight line distance. In 
other words, the initial routing conserved space and 
used the shortest line possible to connect the compo- 
nents. The rising and falling waveforms resulting from 
the TLC simulations of this routing are shown in Fig- 
ure 31. 
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Figure 31. TLC Simulation of A12 Net Using Straight Line Lengths 
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Notice the large amount of ringing that occurs in this 
routing of the net. The excessive ringing can cause fail- 
ures in both the signal quality and flight specifications. 
To bring the net within specification the routing must 
be improved to better “balance” the net. At first glance, 
the loading on the 82496 cache controller branch is 
much less than the 82491 cache SRAM branch. Split- 
ting the 82491 cache SRAM branch into two branches 
and continuing the H-type routing along those branch- 
es and lengthening the 82496 cache controller branch 


] 
intel ° 
should improve the “balance.” The asymmetry energy 
factor, described in Section 5.2, was used to derive the 


relationship between individual trace segments needed 
to balance the net. The relationship is that: 


Lb = 2* La + Leo + 800 mils; La < 2000 mils 
1.6 * La + Le + 1500 mils ; La > 2000 mils 


Following these rules ensures that the A12 net is elec- 
trically symmetric. Figure 32 illustrates this improved 
routing. 
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Figure 32. Improved Topology of A12 Net 


The improved net was also simulated using TLC and 
the resulting waveforms are shown in Figure 33 
through Figure 35. Notice the reduction in ringing. 
With the “balanced” or electrically symmetric routing 
the net exhibits better flight time and signal quality. In 
fact these parameters are now within specification as 
summarized in Table 7. 


The technique for measuring flight time and signal 
quality are described in detail in Section 2.0. To mea- 
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sure flight time one must first determine the 50% point 
of the unloaded Pentium processor driver as shown in 
Figure 33. In the example this occurs at 2.26 ns. Next 
one must determine where the waveform crosses the 
50% Vcc point at the receiver as shown in Figure 34. 
This crossing occurs at 4.54 ns. The time difference 
between these two points is the 50-50 flight time. The 
50-50 flight time for the A12 net example is 2.28 ns. 
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a 

intel. 

Flight time also assumes the waveform continues 
through the 50% Vcc point with a slope of at least 
1 V/ns through the 65% Vcc point. To ensure this, first 
determine the 65% point on the A12 flight time simula- 
tion as shown in Figure 35. The 65% Vcc point is 
5.32 ns. Next extrapolate using the 1V/ns line to find 
where it crosses the 50% voltage level by subtracting 
0.68 ns from the 65% Vcc number. The extrapolated 
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50% Vcc point for the example is 4.64 ns. The time 
difference between the unloaded buffer’s 50% point 
and the extrapolated crossing of the 50% point is the 
50-65 flight time. The 50-65 flight time is 2.38 ns. 


The greater of the 50-50 and 50-65 flight times is the 
flight time for the net. In this case, the flight time is 
2.38 ns, the 50-65 flight time. ; 
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Figure 33. Measuring the 50% Vcc Point of the Unloaded Output 
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Figure 34. Measuring the 50% Vcc Point at the A12 Input of the 82496 
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Figure 35. Measuring the 65% Vcc Point at the A12 Input of the 82496 
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Figure 36 and 37 illustrate the measurement of the sig- measured between points A and B. Ringback is the 
nal quality parameters for the A12 net. Overshoot is the maximum voltage amount that the signal cross back 
maximum voltage above Vcc. Time beyond supply is across Vcc. Settling time is measured from point C and D. 
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Figure 36. Measuring the 50% Vcc Point of the Unioaded Output at the Fast Corner 
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Figure 37. Measuring the Signal Quality of the 82496 


The flight time and signal quality specifications for this 
net are listed in Table 7 along with the values measured 
in the simulation of the net. 


Table 7. Flight Time and Signal Quality Simulated Values 


7.0 256K CPU-CACHE CHIP SET 
OPTIMIZED INTERFACE LAYOUT 
DESIGN EXAMPLE 


This chapter contains an example layout design for 
Intel’s 256 Kbyte CPU-Cache Chip Set’s optimized in- 
terface. Intel has simulated and verified the example 
layout using the latest information. Work is currently 
underway to validate the design by measuring the flight 
time and signal quality parameters on boards based on 
the design example. As updated information becomes 
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[-Sianat | FightTime | Overshoot | Ringback | Setting Time 
[Spee [TG | Spec | 16 | Spee | TO | Spee | lo 
aera | aov | +eev | a5m veo | os7v | iasne | a76ne_ 


available on the components and the boards, Intel plans 
to update this example accordingly. 


The intent of the design example is to provide system 
designers a starting point. It provides one solution of 
how the Pentium processor, 82496 cache controller, 
and 82491 cache SRAM components can be placed and 
routed to ensure flight time and signal quality specifica- 
tions are met. It is not the only solution. System design- 
ers can alter the layout to meet their system require- 
ments as long as the flight time and signal quality speci- 
fications are met. 
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7.1 Layout Objectives 


The 256K layout is an example of a CPU-Cache chip 
set arrangement that meets Intel’s chip set specifica- 
tions. The layout consists of 1 Pentium processor, 1 
82496 cache controller, and 10 82491 cache SRAMs for 
a 256K second-level cache with parity. Although the 
layout is specifically designed for a chip set with parity, 
we will also discuss conversion to a non-parity layout. 


This example layout follows the chip set’s flight time 
and signal quality specifications. In addition to meeting 
those specifications, we had the following objectives: 


1. To design the optimized portion of the interface so 
that the layout is not limited by interconnect per- 
formance. By not artificially creating any critical 
paths, the interface can yield maximum performance 
of the chip set. 


2. To be consistent with EMI and thermal require- 
ments. 


3. To have the layout be used as a validation and cor- 
rection vehicle by Intel. Intel will use the layout to 
validate the optimized interface of the chip set, mea- 
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sure flight times and signal quality, and tune input 
and output buffers. 


Provided are complete specifications for a board layout: 
part lists, board layer plots, and the electronic files in 
Gerber format. Also provided are a set of topologies 
and line lengths so it will be easy to understand how the 
layout was generated. 


7.2 Component Placement 


To meet flight time with clock skew restrictions we 
placed the parts in relative proximity to each other. At 
the same time, we ensured that the layout’s Memory 
Bus Controller (MBC) interface signals are routable. 
Figure 38 illustrates how the chip set components are 
placed in the layout example. The dot indicates the lo- 
cation of pin 1. Figure 38 component side view of the 
layout. 
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Figure 38. Component Placement 


7.3 Signal Routing/Topologies 


Tables 8 and 9 list the signal nets and their correspond- 
ing topologies for the optimized and external interfaces 
of the CPU-Cache Chip Set. | 


All chip set signals in the optimized interface fall into 
six groups: low addresses, high addresses, Pentium 
processor control, 82496 control, CPU data, and byte 
enables. Within each group are subsets of signals that 
share common origination and destination points. Each 
subset has a unique routing called a “topology.” 
Groups, subsets, and topologies are listed in Table 8. 


Topologies are given only for signals that are routed to 
multiple chips. It is the system designer’s responsibility 
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for routing the “point-to-point” signals such as 
CADS#. 


Topologies are also supplied for the external interface. 
These topolgies provide channels for routing signals 
from the chip set components to the periphery where 
they can be connected to the memory bus and memory 
bus controller (MBC). However, topologies are not 
supplied for point to point signals in the MBC interface 
(e.g. CRDY #). Instead, the system designer must opti- 
mize these for the particular application. 


Table 9 lists the topologies provided for the MBC inter- 
face signals which are not point to point. 
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Table 8. Optimized Interface Signal Net/Topology Assignments 


Routing Requirements 


Low Addresses 
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Topology 


Bused to all core components. Must be 
routed to optimize delay and signal quality at 
all points. : 


(PA3-—PA16) 


High Addresses 


(PA17-PA31, PBTO-—PBT3) Point to point links. Must be kept as short as 
possible. 


Pentium™ Processor Control 


(HITM#, W/R#) Same as low addresses. 


(ADSC #, AP, CACHE#, D/C#, LOCK#, Same as high addresses. 
M/lIO#, PCD, PWT, SCYC) : 


(BRDYC2#, WRARR#, MCYC#, WAY, 
BUS #, MAWEA#, WBWE#, WBTYP#, 
WBA, BLAST #) 


(BLEC#) Not routed to parity CSs. 


(AHOLD, EADS #, KEN#, BRDYC1#, INV, Same as high addresses. 
EWBE #, NA#, WB/WT#) 
CPU Data 
(CD0-CD63) Point to point signals. Keep as short as 


> 
S, 
a 
Mar 


Must be routed to optimize delay and signal 
quality at the CS. 


possible. Keep the total length of each trace 
within 1/2” of each other to minimize skew. 


Byte Enables 


Table 9. External Interface Signal Net/Topology Assignments 


RESETC50, CRDYO # 


(CBEO # -CBE7 #) 


MBRDY #, MOCLK, MDOE # 12, 13 
MFRZ#, MSEL#, MZBT#, MCLK 
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Figures 39 through Figure 58 are the topologies which ed. A topology shows the components that share a spe- 
are described in Tables 8 and 9. A topology is a graphi- cific signal and the relative lengths of the traces be- 
cal representation how specific sets of signals are rout- tween components. 
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Figure 39. Topology 1 
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Figure 40. Topology 1b 
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Routing Topology 3 
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Figure 41. Topology 3 
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Figure 42. Topology 3a 
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Figure 43. Topology 3b 
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Figure 44. Topology 4 
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Figure 45. Topology 5 
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Figure 46. Topology 10 
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Figure 47. Topology 11 
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Figure 48. Topology 12 
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Figure 49. Topology 13 
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Figure 50. Topology 14 
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Figure 51. Topology 15 
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Routing Topology 16 
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Figures 56 and 57 provide topologies for the non-parity Refer to Section 7.7.1 for more details on the non-pari- 
configuration of the 256 Kbyte CPU-Cache Chip Set. ty configuration. 
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Table 10 lists the minimum and maximum trace char- 
acteristics. These parameters along with the board ma- 
terial determine the spacing between layers and the to- 
tal board thickness. See Table 11. 


7.4 Board/Trace Properties 


Specific board and trace properties were assumed while 
performing the simulations to optimize the chip set lay- 
out. These properties were used as the specification or 
guideline the board manufacturer was to use in building 
boards. Figure 58 provides the board layer stackup. 


Only the inner layers of the board are impedance con- 
trolled. The top and bottom layers are not impedance 
controlled. 
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Figure 58. Board Layer Stackup 


Table 10. Trace Characteristics 


_ Table 11. Other Printed Circuit Board Geometries 
ae DCA CRT sce aie Sy Sa 
a oe me 
oe oe ee 
ACS al AES 


7.5 Design Notes 


2. All fast-switching signals are routed near the power 
and ground planes on inner layers of the board to 
minimize EMI effects. However, two sets of signals 
are routed on the top layer of the _ board: 
BRDYC1#, and JTAG signals. BRDYC1 # is rout- 


The following design notes accompany this layout ex- 
ample: 


1. The layout did not specifically address heat dissipa- 


tion except to allow space for heat sinks to be at- 
tached. Please see the Pentium™ Processor User’s 
Manual for the devices’ thermal specifications. The 
Pentium Processor Thermal Design Guidelines appli- 
cation note provides some examples of possible ther- 
mal solutions. 
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ed on top to take advantage of the higher trace veloc- 
ity there. JTAG signals are routed on the top layer 
because they are low-speed signals and will probably 
be re-routed by each customer to suit individual 
needs. 
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3. Resistor R1 (0) is used to set the Pentium processor 
configurable output buffers (A3-—A20, ADS#, 
W/R#, and HITM#). When the resistor is included 
the buffers are set to the Extra Large size. When it is 
not included (BUSCHK # internally pulled high) the 
buffers are set to Large size. Intel currently recom- 
mends the large buffers be used for the 256K layout 
example. The O21) resistor should be designed into 
your design as Intel may change the recommended 
buffer size once silicon and the system design have 
been characterized. 

4. The 82496 output buffers that drive the 82491 inputs 
must also be configured to be Large. This is done by 
driving 82496 CLDRV[BGT #] (pin N04) high dur- 
ing reset. 82496 and 82491 Memory Bus buffer sizes 
must be controlled by the Memory Bus Controller. 

5. Series termination resistors were added to the nets 
PA17, PA18, PA19, and PA20 to control overshoot. 
A value of 242 is currently recommended, but that 
value may change when overshoot is measured on an 
actual board. 


7.6 Explanation of Information 
Provided 


The following sections outline the design files associat- 
ed with the 256Kbyte CPU-Cache Chip Set design ex- 
ample that are available from Intel. These files are pro- 
vided to simplify the task of porting the design example 
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into a specific design. By using these files, designers 
may eliminate or minimize the amount of duplicate ef- 
fort when using the design example as the basis for 
their design. The following items are available: 


@ Schematics 

e I/O Model Files 

e Board Files 

e Bill of Materials 

® Photoplot Log 

® Netlist Report 

e Placed Component Report 

e Artwork for Each Board Layer 
e Trace Segment Line Lengths 


Hard copies of the schematics and trace segment line 
lengths are provided in the following sections. ASCII 
or soft copies of all the information are available from 
Intel by requesting order number 241663, AP-481 De- 
sign Diskettes. 


7.6.1 SCHEMATICS 


Schematics for the 256 Kbyte CPU-Cache Chip Set de- 
sign example were created using ViewLogics’s Work- 
view V4.1. The schematics are 14 pages long. Both the 
Workview and the postscript files are available from 
Intel as described above. 
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7.6.2 1/0 MODEL FILES 


All electrical I/O simulations were performed using 
TLC V4.1.13 from Quad Design Technology, Inc. The 
simulations were performed at the fast and slow corners 
to verify all signal quality and flight time specifications 
are met. The files used for these simulations are avail- 
able from Intel as described above. These files include 
the topology, model, and control files needed to run the 
simulations for all nets in the optimized interface. 


7.6.3 BOARD FILES 


The board files for the design example were created 
using Allegro V4.2 from Cadence Design Systems, Inc. 
The files are available from Intel as described above. 
These files may be used to import the design example 
into a specific system design. Note: some changes to the 
layout and nets may be necessary to complete import- 
ing these files into a specific system design. 


7.6.4 BILL OF MATERIALS 


The bill of materials file was created using Allegro V4.2 
from Cadence Design Systems, Inc. The file is available 
from Intel as described above. 


7.6.5 PHOTOPLOT LOG 
The photoplot log file was created using Allegro V4.2 


from Cadence Design Systems, Inc. The file is available 
from Intel as described above. 
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7.6.6 NETLIST REPORT 


The netlist report was created using Allegro V4.2 from 
Cadence Design Systems, Inc. The file is available from 
Intel as described above. 


7.6.7 PLACED COMPONENT REPORT 


The placed component report was created using Alle- 
gro V4.2 from Cadence Design Systems, Inc. The file is 
available from Intel as described above. 


7.6.8 ARTWORK FOR EACH BOARD LAYER 


The artwork for the six board layers were created using 
Allegro V4.2 from Cadence Design Systems, Inc. The 
files are available from Intel in a Gerber format as de- 
scribed above. 


7.6.9 TRACE SEGMENT LINE LENGTHS 


Sections 7.6.9.1 to 7.6.9.10 list the segment line lengths 
for each net of the optimized interface. All lengths are 
provide in mils (1/1000 inch). The stubs listed in the 
following tables are associated with the pin escapes re- 
quired for the 82491s. 
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7.6.9.1 Low Addresses (Topology 1) 


Routing Topology 1 
iz 


82491#2 82491#1 Critical branch points for 
cd 08-15 cd 00-07 non-parity layout 


82491#9 8249146 8249185 Pentium(tm) 
(parity) cd 40-47 cd 32-39 Processor 
0-3 


82491#10 8249184 8249183 
(parity) cd 24-31 cd 16-23 
4-7 


8249188 8249187 
cd 56-63 cd 48-55 


La <= 2.0°: Lb = 2La+lv+0.8" +-0.1° 
La > 2.0": Lb = 1.6la+lv+1.5° +-0.1° 


241576-80 
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ne | 1401 | 1540 [si202| r1s20 | 006s 
Trav | teste | 6524 | soere| 11s20 | ses | 11565 

oes 

ses 

+ [8885 

3065 

3065 


oat 


rears iva | 17465 [sas7o] 12060 | e065 | 10005 
rrata| 1506 | 10031 [saoae| rove | eves | 1015 | 0085 
rrats| ier | 17926 [sso04| roe | ces | 1015 | e085 
rrate| 1626 | 1619 [seso.1| 1as80 | ces | 1205 | o08s 


PP = Pentium processor 
CC = 82496 cache controller 
CS = 82491 cache SRAM 


PP =Pentium processor 
CC = 82496 cache controller 
CS = 82491 cache SRAM 
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7.6.9.2 High Addresses (Topology 4, Point-to-Point) | 


Routing Topology 4 


Pentium(tm) 
Processor 


phage 


PA17* 


*NOTE: 

240. resistor included on PA[17-20]. 

Lengths are Pentium processor-resistor + resistor-82486, respectively. 
PP = Pentium processor 

CC = 82496 cache controller \ 
CS = 82491 cache SRAM 


PRELIMINARY 3-275 


AP-481 | 3 : ntel 3 


7.6.9.3 Pentium™ Processor Control (Topology 1) 


Routing Topology 1 
iL2 ; 


82491#2 8249181 Critical branch points for 
cd 08-15 cd 00-07 non-parity layout 


82491#9 8249186 8249185 Pentium(tm) 
(parity) cd 40-47 cd 32-39 Processor 
0-3 


82491#10 8249184 8249183 
(parity) cd 24-31 cd 16-23 
4-7 


8249188 8249187 
cd 56-63 cd 48-55 


La <= 2.0°: Lb = 2La+lLlv+0.8° +- 0.1" 
La > 2.0°: Lob = 1.6la + lv +1.5° +-0.1° 
241576-82 


"NET [pp-c805 [pp-c8¥a | pr-cc |CS#s-c8e1 | 651-082] 0S#5-c546 | CS¥6-csv0 
famwa| 2052 | ais07 [oars] tai | oes | 140s | e065 
wre | 40011 | aose2 [ore | 11000 | e065 T2265 


PP =Pentium processor 
CC = 82496 cache controller 
CS = 82491 cache SRAM 
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7.6.9.4 Pentium™ Processor Control (Topology 3b No 82496) 


Routing Topology 3b 
Lz 


82491#2 8249181 
cd 08-15 cd 00-07 


82491#9 82491#6 82491#5 Pentium(tm) 
(parity) cd 40-47 cd 32-39 Processor 
0-3 


Lh 


82491#10 82491#4 8249183 
(parity) cd 24-31 cd 16-23 
4-7 


82491#8 8249147 
cd 56-63 cd 48-55 


For non-parity layout, Lg = Lh 
241576-83 


PP =Pentium processor 
CC = 82496 cache controller 
CS = 82494 cache SRAM 
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7.6.9.5 82496 Control (Topology 3) 


Routing Topology 3 
Lz 


82491#2 
cd 08-15 


8249141 
cd 00-07 


8249149 8249186 
(parity) cd 40-47 


82491845 Pentium(tm) 
cd 32-39 Processor 
0-3 


Lz 


82491#10 8249184 
(parity) cd 24-31 
4-7 


8249183 
cd 16-23 


8249188 
cd 56-63 


82491#7 
cd 48-55 


Lh=Llg+1.4" +-0.1° For non-parity layout, Lg = Lh 
241576-84 


cc-cs#1 | cc-cs#5 | cc-cs#3 | CC-cs#7 | CS#1-cS#2 | CS#5-CS#6 


ET 

reuse | «607 | azrs7 | eato0 | aera | ones 

rMawea | ea7e4 | s0680 | cous | asta | ses | e065 
Pwcyce | «o10 | s6070 | sane | sears | ones 

rwea | e112 | ara7e | eros | sttea | ses | s00s 
rwerve | asesa [20002 | 20735 | aa7e0 | soes | e005 
rwawer | as0ea | atoar | rao | asti7 | snes | e085 
rwaanne | ste | 27050 | e021 | atone | e085 | e607 
rway | asi | soa | cera | aetea | ones 
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BRDYC2# (cont) 
BUS # 
(cont) 


MAWEA # (cont) 


MCYC# 
(cont) 


WBA 
(cont) 
WBTYP 
(cont) 
WBWE # 
(cont) 
WRARR # 
(cont) 
WAY 
(cont) 


PP = Pentium processor 
CC = 82496 cache controller 
CS=82491 cache SRAM 
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85.0-—85.0 


85.0-—85.0 
75.0-—75.0 


135.3-135.3 
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7.6.9.6 82496 Control (Topology 3a Not Connected to Parity 82491’s) 


Routing Topology 3a 
Lz 


82491#2 8249141 
cd 08-15 cd 00-07 


8249189 8249186 82491485 Pentium(tm) 
(parity) cd 40-47 cd 32-39 Processor 
0-3 


82491#10 8249184 82491#3 
(parity) cd 24-31 cd 16-23 
4-7 


82491#8 82491#7 
cd 56-63 cd 48-55 


Lh=lg+1.4° +- 0.1" For non-parity layout, Lg = Lh 
241576-85 


cc-cs#1 | cc-cs#5 | cc-cs#3 | cc-cs#7 | cS#1-cS#2 | CS#5-CS#6 
BLEC# | 4222.7 4219.0 4205.8 4206.4 9365 | 9365 | 


NET | cs#6-cs¥9 | cs#3-cs#4 | cS#4-cs¥10 | CS#7-cs#8 | Stubs | 
n/a 


4 


PP =Pentium processor 
CC = 82496 cache controller 
CS = 82491 cache SRAM 
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7.6.9.7 82496 Control (Topology 1b Pentium™ Processor and 82496 Switch Positions) 


Routing Topology 1b 


82491489 
(parity) 
0-3 


82491#10 


82491#2 
cd 08-15 


8249146 
cd 40-47 


8249184 


Lz 


8249181 


cd 00-07 


8249145 
cd 32-39 


8249143 
cd 16-23 


Critical branch points 
for non-parity layout 


Pentium(tm) 
Processor 


(parity) cd 24-31 
4-7 


8249148 82491#7 
cd 56-63 cd 48-55 


241576-86 


BOFF # 936.5 944.8 936.5 75.0-75.0 
(cont) 


PP = Pentium processor 
CC = 82496 cache controller 
CS = 82491 cache SRAM 
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7.6.9.8 82496 Control (Topology 4, Point-to-Point) 


Routing Topology 4 


Pentium(tm) 
Processor 


241576-87 


PP = Pentium processor 
CC = 82496 cache controller 
CS = 82491 cache SRAM 


7.6.9.9 Byte Enables (Topology 5) 


Routing Topology 5 


Pentium(tm 
Processor 


241576-88 
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Tee-CS#1 Tee-cs#9_ | Stubs 
CBEO# 3035.4 1634.4 1633.9 112.4 135.3 


Tee-CS#2 Tee-cs#9_ | Stubs 
1294.8 1293.1 75.0-121.0 


Tee-CS#4 Tee-CS#9 
CBE3# 3547.8 1194.8 1192.1 


Tee-CS#5 Tee-cs#10_ | Stubs 
CBE4# 3600.9 2243.5 2242.0 75.0-135.3 


NET Tee-CS#7 Tee-CS#10 
CBE6# 4662.0 1663.0 1662.6 


Tee-CS#8 veecowse | [> shite red 
1167.7 1165.5 75.0-75.0 


PP =Pentium processor ,; 
CC = 82496 cache controller 
CS = 82491 cache SRAM 


7.6.9.10 CDATA and Parity (Point-to-Point) 


Routing Topology 4 


Pentium(tm) 
Processor 


241576-89 
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Cnet | pp-cse0 | Sub 
oro | sreaa | 1050 
opr | seaar | 1053 
ope | 50647 | 750 
Tore | sriaa_| 750 


PP-CS#2 


[stub 
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5541.4 135.3 


ner | pp-csea | sub | net | pr-csve 
coe | seae« | 1050 | coo | seoas | 1063 | como | 52000 
750 


1253 | cow | seas 
750 | cos | 5756.1 | 750 
1955 


ner | ppcosei0 | stub 
Tors | ses8 | 105.0 
Tore | ssen8 | 750 


ss 
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ner | pp-cse7 | Sub 


ce 


PP = Pentium processor 
CC = 82496 cache controller 
CS =82491 cache SRAM 


7.6.10 Pentium™ PROCESSOR TO 82496 
SEGMENT LENGTH AND ROUTING 
CHANGES 


The example layout described in this application note 
was completed using early revisions to the I/O buffer 
models. This process was necessary to ensure that a 
board was available for the arrival of first silicon. After 
the models were improved based on the model valida- 
tion and silicon characterization, the board layout was 
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Cnet | pp-csee | sub 
Toose | ses00 | 1053 
cos7 | 5000 | 1053 
cost | _se0a0 | 750 


resimulated. These simulations have resulted in the rec- 
ommendation to change the line length between the 
Pentium processor and 82496 for several nets. These 
changes result in a better tuned routing that meets the 
specifications. In particular, these changes reduce the 
amount of ringback and the ringing that leads to long 
settling times. Table 12 summarizes the recommended 
segment length changes. 


Table 12. Summary of Segment Lengths 


Segment 


Net/Signal Name 
WRARR # 
WRARR # 


PA12 
PA16 


PP = Pentium processor 
CC = 82496 cache controller 
CS = 82491 cache SRAM 


Actual system measurements have shown that the orig- 
inal segment lengths do not violate the specifications. 
The reduction in ringing is probably due to transmis- 
sion line losses which are not accounted for in the simu- 
lation. Therefore for completed designs using the exam- 
ple layout these changes are not necessary; however, 
Intel does recommend that all future designs that use 
the layout example use these new lengths. 
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Original Length (in.) Recommended Length (in.) 


In addition, a layer change to the BRDYC1# routing 
is recommended. Section 7.5 Design Notes, describes 
that BRDYC1# was routed on an outer layer to reduce 
the propagation delay; however, this resulted in a signal 
quality violation. Since the original routing of the 
board, the flight time specification was relaxed and 
BRDYCI ¥# can now be routed on an inner layer which 
allows it to meet both signal quality and flight time 
specifications. 
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7.6.11 1/O SIMULATION RESULTS FOR EACH 
NET 


Electrical simulations were performed on each net 
within the optimized interface of the 256 Kbyte CPU- 
Cache Chip Set design example. The simulations were 
done at the fast and slow corners to verify that signal 
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hed 


quality and flight time specifications are met. The simu- 
lations were done using TLC V4.1.13 from Quad De- 
sign Technology, Inc. using the files described in Sec- 
tion 7.6.2. Table 13 summarizes the simulation results 
assuming all the segment length changes listed in Sec- 
tion 7.6.10 have been implemented along with the layer 
change to the BRDYC1 # routing. 
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* 
. 
« 


Table 13. Summary of Simulation Results 


Flight Time 
(ns) 


Over/ 
Undershoot (V) 


Signal Quality 


Ringback 
(V) 


1.75 


82496 Driving 


Settling Time 


(ns) 


AP-481 


Time Beyond 


Supply 
(ns) 


A3-16 7.2 
at CPU 


A3-16 
at SRAM 


A17-31 
at CPU 


BTO-3 
AHOLD 


” 
oO 


acaili ‘ 
ti 


— 
BAS 


BRDYC1 # 
EADS# 
EWBE # 
INV 

KEN# 1.0 
NA# 
WB/WT# 
BLAST# 
BLEC# 


BOFF # 
at CPU 


BOFF # 
at SRAM 


BRDYC2# 


eYERRF RF URITERERE 
U 

2 

~J 
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2.1 
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1.75 
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ae ee 
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~J 


N 
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are 
NN 


axl 
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es 
oe 
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N 
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12.1 


okt 
©O 
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Oo 
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N 
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Table 13. Summary of Simulation Results (Continued) 


Signal Quality 
Flight Time 


(ns) Over/ Ringback Settling Time 


Undershoot (V) (V) (ns) 


nm 
_" 


cca 
Oo 


Oo 
£ 


i) 
N 
fo) 
N 


Nm 

‘ails 
ok a 
oO oO 


ok 
as 


Pentium™ Processor Driving 


A3-16 
at Controller 


A3-16 
at SRAM 


A17-31 . 


= 
NO 


BTO-3 


DO-63, DPO-7 


; 


HITM# 
at Controller 


=" 
1°») 


W/R# 
at Controller 


W/R# 
at SRAM 
ADSC # 


aaa 
o>) 


—_— 
hk 


ST 
ie) 


CACHE # 


LOCK # 
M/AO# 


—h 
ek 


Time Beyond 


Supply 
(ns) 


Nm | — 
—i1N 


i 
™N 
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Table 13. Summary of Simulation Results (Continued) 


Over/ 


Undershoot (V) 


Signal Quality 


Time Beyond 


Ringback Supply 


Settling Time 


82491 Driving 


DO-63, DPO-7 1.7 min. : 
1.9 max. 


The shaded entries indicate specification violations with 
the example design layout. Intel is continuing to work 
on addressing these violations by either relaxing specifi- 
cations or improving the layout design. Note that no 
violations have been measured on the actual board. The 
violations have all been in simulation. 


7.7 Possible Modification to the 
Layout 


7.7.1 NON-PARITY LAYOUT 


Intel has not simulated a non-parity layout example. 
The following suggestions will assist in modifying the 
design example for non-parity implementations. You 
must simulate all paths that are altered when the parity 
components are removed to ensure that flight time and 
signal quality specifications are still met. 


Modify the following aspects of the layout example: 


1. Remove the two leftmost 82491 components, U9 and 
U10. These are the parity components. 


2. Rework Topologies 1 and 1b. Balance the array so 
that the two critical branch points branch out to 
electrically equivalent traces, i.e., adjust Lw to be 
electrically equivalent to Lv + Lz. Keep the trace 
leading to the Pentium processor length La. Topolo- 
gies INP and 1bNP illustrate this (NP = Non- 
Parity). Also, retune Lb to be electrically equivalent 
with these new trace lengths. Topologies 1 and 1b 
indicate exactly where the critical branch points are. 


3. Rework topologies 3 and 3b. Make the four traces 
branching from the 82496 electrically equivalent. 
This may be accomplished by making Lg = Lh for 
these topologies. 
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Shading indicates a spec. violation. 


4. Remove the Byte Enable traces that connect to the 
parity chips. 


Making traces electrically equivalent means that reflec- 
tions from all branches return to the source at the same 
point in time. In simple cases, electrically equivalent 
traces are the same length. In all cases, simulate the 
effects of changing trace lengths to find the proper trace 
length and routing. 


8.0 512K CPU-CACHE CHIP SET 


OPTIMIZED INTERFACE LAYOUT 
DESIGN EXAMPLE 


This chapter contains an example layout design for 
Intel’s 512 Kbyte CPU-Cache Chip Set’s optimized in- 
terface. Intel has simulated and verified the example 
layout using the latest information. Work is currently - 
underway to validate the design by measuring the flight 
time and signal quality parameters on boards based on 
the design example. As updated information becomes 
available on the components and the boards, Intel plans 
to update this example accordingly. 


The intent of the design example is to provide system 
designers a starting point. It provides one solution of 
how the Pentium processor, 82496, and 82491 compo- 
nents can be placed and routed to ensure flight time 
and signal quality specifications are met. It is not the 
only solution. System designers can alter the layout to 
meet their system requirements as long as the flight 
time and signal quality specifications are met. 
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8.1 Layout Objectives 


The 512K layout is an example of a CPU-Cache chip 
set arrangement that meets Intel’s chip set specifica- 
tions. The layout consists of 1 Pentium processor, 1 
82496 cache controller, and 18 82491 cache SRAMs for 
a 512K second-level cache with parity. Although the 
layout is specifically designed for a chip set with parity, 
we will also discuss conversion to a non-parity layout. 


This example layout targets the chip set’s flight time 
and signal quality specifications. In addition to meeting 
those specifications, we had the following objectives: 


1. To design the optimized portion of the interface so 
that the layout is not limited by interconnect per- 
formance. By not artificially creating any critical 
paths, the interface can yield maximum performance 
of the chip set. 


2. To be consistent with EMI and thermal require- 
ments. : 


oO 


°o 
82491 82491 


(Parity) 


° 
82491 


intel. 


3. To have the layout be used as a validation and cor- 
rection vehicle by Intel. Intel will use the layout to 
validate the optimized interface of the chip set, mea- 
sure flight times and signal quality, and tune input 
and output buffers. 


Provided are complete specifications for a board layout: 
part lists, board layer plots, and the electronic files in 
Gerber format. Also provided are a set of topologies 
and line lengths so it will be easy to understand how the 
layout was generated. | 


8.2 Component Placement 


To meet flight time with clock skew restrictions we 
placed the parts in relative proximity to each other. At 
the same time, we ensured that the layout’s Memory 
Bus Controller (MBC) interface signals are routable. 
Figure 59 illustrates how the chip set components are 
placed in the layout example. The dot indicates the lo- 
cation of pin 1. Figure 59 shows a component side view 
of the layout. 


Pentium™ 


82491 Processor 


1-3 cd40-43 cd36-39 Cd32-35 


82491. 82491 - 82491. 


Cd24-27 cd20-23 cd16-19 


° ° fe) 
82491 82491 82491 


cd56-59 cd52-55 cd48-51 


al ane 


°o =Pin 1 
241576-90 


Figure 59. Component Placement 
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~ 
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8.3 Signal Routing/Topologies 


Table 14 and Table 15 list the signal nets and their 
corresponding topologies for the optimized and exter- 
nal interfaces of the CPU-Cache Chip Set. 


All chip set signals in the optimized interface fall into 
six groups: low addresses, high addresses, Pentium 
processor control, 82496 control, CPU data, and byte 
enables. Within each group are subsets of signals that 
share common origination and destination points. Each 
subset has a unique routing called a “topology.” 
Groups, subsets, and topologies are listed in Table 14. 


Topologies are given only for signals that are routed to 
multiple chips. It is the system designer’s responsibility 
for routing the “point-to-point” signals such as 
CADS#. 


ma tae a 
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Topologies are also supplied for the external interface. 
These topolgies provide channels for routing signals 
from the chip set components to the periphery where 
they can be connected to the memory bus and memory 
bus controller (MBC). However, topologies are not 
supplied for point to point signals in the MBC interface 
(e.g. CRDY #). Instead, the system designer must opti- 
mize these for the particular application. 


Table 15 lists the topologies provided for the MBC in- 
terface signals which are not point to point. 


Figures 60 through 77 are the topologies which are de- 
scribed in Table 14 and 15. A topology is a graphical 
representation how specific sets of signals are routed. A 
topology shows the components that share a specific 
signal and the relative lengths of the traces between 
components. 


Table 14. Optimized Interface Signal Net/Topology Assignments 


(ADSC #, AP, CACHE #, D/C#, 
LOCK #, M/IO#, PCD, PWT, SCYC) 


(AHOLD, EADS #, KEN#, 
BRDYC1 #, INV, EWBE#, NA#, 
WB/WT#) 


(CDO0-CD63, CPO-CP7) 


(CBEO # -CBE7 #) 
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Low Addresses 


(BUS#, MAWEA#, WBWE#, Must be routed to optimize delay and signal quality at 

WBTYP#, WBA, BLAST #) the CS. 

(BLEC #) Not routed to parity CSs. 4 
(GOFF #) lis eed 


Same as high addresses. 


Keep within 3” of each other to minimize skew. 


High Addresses 


(PA18—PA31, PBTO-PBT3) Point to point links. Must be kept as short as 
possible. 


Pentium™ Processor Control 


(HITM#, W/R#) Same as low addresses. 1 
(ADS#) Ea ee are 


Same as high addresses. 


(BRDYC2#, WRARR#, MCYC#, Routed differently. 15 
WAY) 


CPU Data 
Point to point signals. Keep as short as possible. 


Byte Enables 


2 Aiea Ee aS 
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Table 15. External Interface Signal Net/Topology Assignments 


ne ee oe Topology | 
MDATAO-63, ParityO-7 


Ee ee ea Ges ne van 
ec) a ONO eee. eT ed ip 
eS ee eee Gee nea ee 


MFRZ#, MSEL#, MZBT#, MCLK 
RESETCPU 
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Routing Topology #1 


°o 


° ° 
82491 82491 82491 82491 


cd12-15 cd8-11 cd4-7 cd0o-3 


Pentium™ 


Processor 


1) fe) °o ° 
82491 82491 82491 82491 


82491 ~ 
(Parity) 


1-3 cd44-47 cd40-43 cd36-39 cd32-35 


° re) ° 
82491 82491 82491 
cd24-27 cd20-23 cd16-19 
82496 
° @) ° 
82491 82491 82491 
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Figure 60. Topology 1 
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Figure 61. Topology 2 
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Figure 62. Topology 3 
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Figure 65. Topology 6 
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Figure 66. Topology 7 
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Figure 67. Topology 8 
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Figure 68. Topology 9 
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Figure 69. Topology 10 
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Figure 70. Topology 11 
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Figure 71. Topology 12 
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Figure 72. Topology 13 
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Figure 73. Topology 14 
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Figure 74. Topology 15 
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Figure 75. Topology 16 
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Figure 76. Topology 17 
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Figure 77. Topology 18 


8.4 Board/Trace Properties out. These properties were used as the specification or 
guideline the board manufacturer was to use in building 
Specific board and trace properties were assumed while boards. Figure 78 provides the board layer stackup. 


performing the simulations to optimize the chip set lay- 
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Figure 78. Board Layer Stackup 


Table 16 lists the minimum and maximum trace char- 
acteristics. These parameters along with the board ma- 
terial determine the spacing between layers and the to- 
tal board thickness. See Table 17. 


Table 16. Trace Characteristics 


1.85 to 2.41 ns/ft 


Table 17. Other Printed Circuit 
Board Geometries 


[—paared [see 
a 
ear ee 


Only the inner layers of the board are impedance con- 
trolled. The top and bottom layers are not impedance 
controlled. 


, 
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8.5 Design Notes 


The following design notes accompany this layout ex- 
ample: 


1. The layout did not specifically address heat dissipa- 
tion except to allow space for heat sinks to be at- 
tached. Please see the Pentium™ Processor User’s 
Manual for the devices’ thermal specifications. The 
Pentium™ Processor Thermal Design Guidelines ap- 
plication note provides some examples of possible 
thermal solutions. 


2. All fast-switching signals are routed near the power 
and ground planes on inner layers of the board to 
minimize EMI effects. However, two sets of signals 
are routed on the top layer of the board: 
BRDYC1#, and JTAG signals. BRDYC1 # is rout- 
ed on top to take advantage of the higher trace veloc- 
ity there. JTAG signals are routed on the top layer 
because they are low-speed signals and will probably 
be re-routed by each customer to suit individual 
needs. 


3. Resistor RS (02) is used to set the Pentium proces- 
sor configurable output buffers (A3—-A20, ADS#, 
W/R#, and HITM#). When the resistor is included 
the buffers are set to the Extra Large size. When it is 
not included (BUSCHK # internally pulled high) the 
buffers are set to Large size. Intel currently recom- 
mends the x-large buffers be used for the 512K lay- 
out example. The O/N resistor should be designed into 
your design as Intel may change the recommended 
buffer size once silicon and the system design have 
been characterized. 
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4. The 82496 output buffers that drive the 82491 inputs 
must also be configured to be x-large. This is done by 
using a pulldown resistor on 82496 CLDRV[BGT #] 
(pin N04). 82496 and 82491 Memory Bus buffer 
sizes must be controlled by the Memory Bus Con- 
troller. 


5. Series termination resistors were added to the nets 
PA18, PA19, and PA20 to control overshoot. A val- 
ue of 24W is currently recommended, but that value 
may change when overshoot is measured on an actu- 
al board. 


8.6 Explanation of Information 
Provided 


The following sections outline the design files associat- 
ed with the 512 Kbyte CPU-Cache Chip Set design 
example that are available from Intel. These files are 
provided to simplify the task of porting the design ex- 
ample into a specific design. By using these files, de- 
signers may eliminate or minimize the amount of dupli- 
cate effort when using the design example as the basis 
for their design. The following items are provided: 


® Schematics 
© I/O Model Files 
@ Board Files 
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e Bill of Materials 

¢ Photoplot Log 

® Netlist Report 

e Placed Component Report 

e Artwork for Each Board Layer 
e Trace Segment Line Lengths 


Hard copies of the schematics and trace segment line 
lengths are provided in the following sections. ASCII 
or soft copies of all the information are available from 
Intel by requesting order number 241663, AP-48] De- 
sign Diskettes. 


8.6.1 SCHEMATICS 


Schematics for the 512 Kbyte CPU-Cache Chip Set de- 
sign example were created using ViewLogics’s Work- 
view V4.1. The schematics are 13 pages long. Both the 
Workview and the postscript files are available from 
Intel as described above. ; 
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8.6.2 I/O MODEL FILES 


All electrical I/O simulations were performed using 
TLC V4.1.13 from Quad Design Technology, Inc. The 
simulations were performed at the fast and slow corners 
to verify all signal quality and flight time specifications 
are met. The files used for these simulations are avail- 
able from Intel as described above. These files include 
the topology, model, and control files needed to run the 
simulations for all nets in the optimized interface. 


8.6.3 BOARD FILES 


The board files for the design example were created 
using Allegro V4.2 from Cadence Design Systems, Inc. 
The files are available from Intel as described above. 
These files may be used to import the design example 
into a specific system design. Note: some changes to the 
layout and nets may be necessary to complete import- 
ing these files into a specific system design. 


8.6.4 BILL OF MATERIALS 


The bill of materials file was created using Allegro V4.2 
from Cadence Design Systems, Inc. The file is available 
from Intel as described above. 


8.6.5 PHOTOPLOT LOG 
The photoplot log file was created using Allegro V4.2 


from Cadence Design Systems, Inc. The file is available 
from Intel as described above. 
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8.6.6 NETLIST REPORT 


The netlist report was created using Allegro V4.2 from 
Cadence Design Systems, Inc. The file is available from 
Intel as described above. 


8.6.7 PLACED COMPONENT REPORT 


The placed component report was created using 
Allegro V4.2 from Cadence Design Systems, Inc. The 
file is available from Intel as described above. 


8.6.8 ARTWORK FOR EACH BOARD LAYER 


The artwork for the six board layers were created using 
Allegro V4.2 from Cadence Design Systems, Inc. The 
files are available from Intel in a Gerber format as de- 
scribed above. 


8.6.9 TRACE SEGMENT LINE LENGTHS 


Sections 8.6.9.1 to 8.6.9.7 list the segment line lengths 
for each net of the optimized interface. All lengths are 
provide in mils (1/1000 inch). The stubs listed in the 
following tables are associated with the pin escapes re- 
quired for the 82491s. 


The following abbreviations have been used to repre- 
sent the different devices: 


PP = Pentium processor 


CC = 82496 cache controller 
CS = 82491 cache SRAM 
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8.6.9.5 Pentium™ Processor and 82496 Control, High Addresses, Pentium™ Processor Data 
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1.0 BACKGROUND 


As new generations and new models of processors have 
been added to the Intel X86 architecture (8086, 8088, 
Intel 286, Intel386™, Intel486™, and Pentium™ 
processors), Intel has provided increasingly sophisticat- 
ed methods to software for identifying the features 
available on the processor. 


e First, Intel started publishing code sequences that 
identify processor generation by detecting minor dif- 
ferences of implementation. 


e Later, with the advent of the Intel386 processor, In- 
tel started providing the CPU signature (family, 
model, and stepping numbers) to software at reset. 


e¢ Ultimately, Intel has extended the X86 architecture 
with the CPUID instruction. The CPUID instruc- 
tion not only provides the CPU signature but also 
provides information about the features supported 
by the processor. 


This latest step is necessary because the computing 
market is demanding processor models within a given 
processor generation that have differing sets of features. 
Anticipating that this trend will continue with future 
processor generations, Intel has made the CPUID in- 
struction extensible. 


The purpose of this Application Note is to show how to 
use the CPUID instruction in such a way that software 
can execute compatibly on the widest possible range of 
Intel X86 generations and models, past, present, and 
future. 


2.0 DETECTING THE CPUID 
INSTRUCTION 


Intel has provided a straightforward method for detect- 
ing whether the CPUID instruction is available. This 
method uses the ID flag in bit 21 of the EFLAGS regis- 
ter. If software can change the value of this flag, the 
CPUID instruction is available. The program example 
in Section 5.0 shows how to use the PUSHFD instruc- 
tion to read and the POPFD instruction to change the 
value of the ID flag. 


3.0 OUTPUTS OF THE CPUID 
INSTRUCTION 


Figure 1 summarizes the outputs of CPUID. 
The CPUID instruction can be executed multiple 


times, each time with a different parameter in the EAX 
register. The outputs depend on the value of EAX, as 
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intel. 
Table 1 specifies. To determine the highest acceptable 
value of the EAX parameter, the program should set 
EAX to zero. In this case, CPUID returns to EAX the 
value of the highest parameter that it recognizes. No 
execution of CPUID should use a parameter greater 
than this highest value. Although this highest value is 1 | 


for the first model of the Pentium processor, it might be 
different for future processor models. 


3.1 Vendor-ID String 


The vendor identification string is stored in the regis- 
ters EBX, EDX, and ECX in such a way that, when the 
contents of these registers are stored in memory in adja- 
cent locations (EBX at the lowest address, ECX at the 
highest), the byte string “Genuinelntel” appears in 
memory. 


While any imitator of the X86 architecture can provide 
the CPUID instruction, no imitator can legitimately 
claim that its part is a genuine Intel part. Therefore, the 
presence of the “‘Genuinelntel’”’ string is an assurance 
that the CPUID instruction and the CPU signature are 
implemented as described in this document. 


3.2 CPU Signature 


Beginning with the Intel386 processor, the CPU signa- 
ture has been available at reset. With processors that 
implement the CPUID instruction, the CPU signature 
is made available both by reset and by the CPUID in- 
struction. Figure 1 shows the format of the signature on 
the Intel486 processor, Pentium processor, and later 
processor generations; Table 2 shows the values that 
are currently defined. (The high-order 20 bits are unde- 
fined and reserved.) 


On Intel386 processors, the format of the CPU signa- 
ture is somewhat different, as Figure 2 shows. Table 3 
gives the currently possible values. 


Regardless of signature format, the MODEL and 
FAMILY fields are significant only in processors that 
do not implement the CPUID instruction. 


The STEPPING fields help software deal with errata. 


Intel releases information about errata and related step- 
ping numbers as needed. 
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OUTPUT IF EAX =0 
31 


HIGH VALUE INTEGER 


31 "23 
VENDOR ID 


OUTPUT IF EAX = 1 


CPU SIGNATURE 


STEPPING 


31 


FEATURE FLAGS BIT ARRAY (Refer to Table 4) 


; 241618-1 


Figure 1. CPUID Instruction Outputs 


Table 1. Effects of EAX Contents on CPUID 
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Figure 2. CPU Signature Format on Intel386™ Processors 


Table 3. Intel386™ CPU Signatures 


Major Minor Description 
Stepping | Stepping? P 
0011 0000 Intel886 DX Processor 


0010 | 0011 | 0000 | xx | Intel386 SX Processor 
0011 | 0011 | 0000 | xx | Intel 376 Processor 


Intel386 SL Processor 
0100 0011 0001 


Intel386 SL Processor (B-step) 
0000 0011 0100 RapidCAD™ Processor 


alntel releases information about minor stepping numbers as needed. 
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3.3 Feature Flags 


The CPUID instruction sets the feature register, EDX, 
to indicate which features the processor supports. Each 
set feature flag can indicate either that a feature is pres- 
ent or that a feature is not present. Refer to Table 4 for 
the meanings of the feature flags that are currently de- 
fined. For each future processor generation or model, 
refer to the programmer’s reference manual, user’s 
manual, or equivalent documentation for additional 
definitions. 


Software executing on a processor that supports the 
CPUID instruction should use the feature flags instead 
of the model and family to determine what features are 
present. Doing so helps make software immune to in- 
compatibilities that might arise from the release of an 
additional model that has a feature set different than 
that of the model for which the software was designed. 


Table 4. Feature Flag Values 


Abbrevia- 
Position| tion Meaning When Flag = 1 


Bit 
Pe FRB Floating-point unit on-chip 
(see note) 
Machine-check exception present 
Ce oe 


CMPXCHG8B instruction present 
9-318 | (see note) 


aSome non-essential information regarding the Pentium 
processor is considered Intel confidential and proprietary 
and has not been documented in this publication. This in- 
formation is provided in the Supplement to the Pentium™ 
Processor User’s Manual and is available with the appropri- 
ate non-disclosure agreements in place. Contact Intel Cor- 
poration for details. 


4.0 USAGE GUIDELINES 


This document presents straightforward -feature-detec- 
tion methods. Software should not try to identify fea- 
tures by exploiting programming tricks, “undocument- 
ed features,” or otherwise deviating from the intent of 
this document. The following list gives some tips that 
can help programmers maintain the widest range of 
compatibility for their software. 


® Do not depend on the absence of an invalid opcode 
trap on the CPUID opcode to detect CPUID. Do 
not depend on the absence of an invalid opcode trap 
on the PUSHFD opcode to detect a 32-bit proces- 
sor. Test the ID flag, as described in Section 2.0 and 
shown in Section 5.0 . 


¢ Do not assume that a given family or model has any 
specific feature. For example, do not assume that, 
because FAMILY=5 (-Pentium processor), there 
must be a floating-point unit on-chip. Use the fea- 
ture flags for this determination. 


PRELIMINARY 


© Do not assume that the existence of the CPUID in- 
struction implies a Pentium processor or later gener- 
ation CPU. Future versions of the Intel486 micro- 
processor may also include the CPUID instruction. 


¢ Do not use undocumented features of a CPU to 
identify steppings or features. For example, the In- 
tel386 CPU A-step had bit instructions that were 
withdrawn with B-step. Some software attempted to 
execute these instructions and depended on the in- 
valid-opcode exception as a signal that it was not 
running on the A-step part. This software failed to 
work correctly when the Intel486 CPU used the 
same opcodes for different instructions. That soft- 
ware should have used the stepping information in 
the CPU signature. 


© Do not assume that a value of 1 in a feature flag 
indicates that a given feature is present, even though 
that is the case in the first model of the Pentium 
processor. For some feature flags that might be de- 
fined in the future, a value of 1 can indicate that the 
corresponding feature is not present. 


e Programmers should be careful to test feature flags 
individually and not make assumptions about irrele- 
vant bits. It would be a mistake, for example, to test 
the FPU bit by comparing the feature register to 1 
with a compare instruction. 


® Do not assume that the clock of a given family or 
model runs at a specific speed. In particular, do not 
write clock-dependent code, such as timing loops. 
An upgrade processor can run at a faster speed, 
while reporting the same family and model. For an 
example, refer to the Intel487 and Intel486DX proc- 
essors in Table 2. These processors have different 
clock speeds, even though they have the same family 
and model numbers. Use the systems timers for 
measuring time. 


5.0 PROPER IDENTIFICATION 
SEQUENCE 


The following program example concludes this docu- 
ment by showing correct usage of the CPUID instruc- 
tion. It also shows how to identify earlier processor 
generations that implement neither the CPU signature 
nor the CPUID instruction. This program contains 
three procedures: 


1. get__cpuid which identifies the type of CPU. Figure 
3 shows the flow of this procedure. 


2. check__fpu which determines what type of floating- 
point unit (FPU) or math coprocessor (MCP) is 
present. 


3. print which uses DOS function calls to display the 
results of the other procedures on the monitor of a 
PC. This procedure can be omitted or modified to 
execute with other operating systems. 


This procedure has been tested with 8086, 80286, In- 
tel386, Intel486, and Pentium processors. 
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cpu_type=2 


cpu_type=3 
No 


cpu_type=4 


is the 
CPUID 
instruction 
supported 
? 


id_flag = 1; indicates CPUID 
instruction present. 
“Execute CPUID with input of 
0 to get vendor ID string and 
input values for E AX. 


If highest input value is at least 
1, execute CPUID with input of 
1 in EAX to obtain model, 
stepping, family, and features. 
Save in cpu_type, stepping, 
model, and feature_flags. 


Does the 
vendor ID = 
“Genuineintel” 
? 


end_get_cpuid 


Figure 3. Flow of GET.CPUID Procedure 
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Filename; cpuid32.msm 


This program has been developed by Intel Corporation. You have 
Intel's permission to incorporate this source code into your 
product royalty free. 


Intel specifically disclaims all warranties, express or implied, 
and all liability, including consequential and other indirect 
damages, for the use of this code, including liability for 
infringement of any proprietary rights. Intel does not assume 
any responsibility for any errors which may appear in this code 
nor any responsibility to update it. 


This program contains three parts: 
Part 1: Identifies CPU type in the variable cpu_type: 
0O=8086 processor 
2=Intel 286 processor 
S5=Intel386(TM) processor 
4=Intel486(TM) processor 
5=Pentium(TM) processor 


Identifies FPU type in the variable fpu_type: 
O=FPU not present 

1l=FPU present 

2=287 present (only if cpu_type=3) 

5=3587 present (only if cpu_type=3) 


Prints out the appropriate message. This part can 

be removed if this program is not used in a DOS=based 
System. Portions affected are at the end of the 

data segment and the print procedure in the code 
segment. 


This program was assembled with Microsoft's Assembler MASM 6.0. 
While this program mostly uses 16-bit operands, some 32-bit 
operands are required to check the 32-bit EFLAGS register once 
it has been determined that the processor is a least an 
Intel386 processor. 32-bit operations are invoked by using 

the macro OPND32. 
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TITLE CPUID 
DOSSEG 

emodel small 
-Stack 100h 
- 186 


; The OPND32 macro takes either zero or two parameters. 

; With zero parameters, it generates the 32-bit operand=-size prefix. 
With two parameters, it generates the 32-bit operand-size prefix, 
followed by an opcode and a 32-bit immediate value. These parameters 

; are used to generate XOR AX,imm32 instructions. 


Example 1. CPU Identification Procedure 
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OPND32 MACRO op_code, op_erand 


db 66h ; Force 32-bit operand size 
IFNB <op_code> 
db op_code ; Optional opcode 


IFNB <op_erand> 


dd op_erand ; Optional 32-bit immediate value 
ENDIF 
ENDIF 
ENDM 
CPUID MACRO ; 
db Ofh ; Opcode for CPUID instruction 
db Oa2h 
ENDM 
TRUE equ uA 
FAMILY_MASK equ Of00h 
FAMILY_SHIFT equ 8 
MODEL_MASK equ OfOh 
MODEL_SHIFT equ 4 
STEPPING_MASK equ Ofh 
FPU_FLAG equ lh 
MCE_FLAG equ 80h 
CMPXCHG8B_FLAG equ 100h 
-data 
fp_status dw ? 
vendor_id db 12 dup (?) 
cpu_type db ; 
model db ? 
stepping db ? 
id_flag db 0 
fpu_type db 0 
intel_proc db 0 
feature_flags dw 2 dup (0) 


; Remove the remaining data declarations if not using the DOS-based 


; print procedure 


id_msg db 
fp_8087 db 
fp_80287 db 
fp_80387 db 
c8086 db 
c286 db 
c386 db 
c486 db 
c486nfp db 
Intel486_msg db 


Pentium_msg 


"This system has a$" 

" and an 8087 math coprocessor$" 

"and an 80287 math coprocessor$" 

"and an 80387 math coprocessor$" 

"n 8086/8088 processor$" 

"n 80286 processor$" 

"n 80386 processor$" 

"n 80486 DX processor or 80487 SX math coprocessor$" 
"n 80486 SX processor$" 

13,10,"This system contains a Genuine " 
"Intel486(TM) processor",13,10,"$" 
13,10,"This system contains a Genuine " 
"Intel Pentium(TM) processor",13,10,"$" 
"Model: $" 

"Stepping: $" 


Example 1. CPU Identification Procedure (Continued) 


modelmsg db 
steppingmsg db 
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familymsg 
period 
dataCR 
intel_id 
fpu_msg 
mce_msg 


cmp_msg 


not_intel 


This code 


-code 


13,10,"Processor Family: $" 

*,* 15 5t0, "3" 

7,13,10,"3" 

"GenuineIntel" 

13,10,"This processor contains a FPU",13,10,"$" 
"This processor Supports the " 

"Machine Check Exception",13,10,"$" 

"This processor Supports the " 

"CMPXCHG8B instruction",13,10,"$" 

"t least an 80486 processor.",13,10 

"It does not contain a Genuine Intel part and as a " 
"result,",13,10,"the CPUID detection information " 
"cannot be determined at this time.",13,10,"$" 


identifies the processor and coprocessor 


that are currently in the system. The program first 
determines the processor id. When that is accomplished, 
the program then determines whether a coprocessor 
exists in the system. If a coprocessor or integrated 
coprocessor exists, the program identifies 

the coprocessor id. The program then prints out 

the CPU and floating point presence and type. 


mov ax, @data 


mov ads, ax 


set segment register 


; 
mov es, ax ; set segment register 
+] 


pushf 


; save for restoration at end 


call get_cpuid 
call get_fpuid 
call print 


popf 


mov ax, 4c0O0Oh ; terminate program 
int 2lh 


get_cpuid proc 


This procedure determines the type of CPU in a system 
and sets the cpu_type variable with the appropriate value. 
All registers are used by this procedure, none are preserved. 


Intel 8086 CPU check 
Bits 12-15 of the FLAGS register are always set on the 
8086 processor. 
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check_8086: 


pushf ; push original FLAGS 

pop ax ; get original FLAGS 

mov CX, ax ; save original FLAGS 

and ax, Offfh ; clear bits 12-15 in FLAGS 
push ax ; save new FLAGS value on stack 
popf ; replace current FLAGS value 
pushf ; get new FLAGS 

pop ax ; store new FLAGS in AX 

and ax, Of000h ; if bits 12-15 are set, then CPU 
cmp ax, Of000h . is an 8086/8088 

mov cpu_type, 0 ; turn on 8086/8088 flag 

je end_get_cpuid ; jump if CPU is 8086/8088 


Intel 286 CPU check 
Bits 12-15 of the FLAGS register are always clear on the 
Intel 286 processor in real-address mode. 


heck_80286 : 


; turn on 80286 flag 
; if no bits set, CPU is 80286 


mov Ccpu_type, 2 
jz end_get_cpuid 


or ex, OfO000h ; try to set bits 12-15 
push cx ; save new FLAGS value on stack 
popf ; replace current FLAGS value 
pushf ; get new FLAGS 
pop ax ; Store new FLAGS in AX 

: and ax, Of000h ; if bits 12-15 clear, CPU=80286 


Intel386 CPU check 

The AC bit, bit #18, is a new bit introduced in the EFLAGS 
register on the Intel486 DX CPU to generate alignment faults. 
This bit cannot be set on the Intel386 CPU. 


heck_80386: 

; It is now safe to use 32-bit opcode/operands ; 
mov bz,. SD ; save current stack pointer to align 
and Sp, not § ; align stack to avoid AC fault 
OPND32 
pushf ; push original EFLAGS 
OPND32 
pop ax ; get original EFLAGS 
OPND32 
mov cx, ax ; save original EFLAGS 
OPND32 35h, 40000h ; flip (XOR) AC bit in EFLAGS 
OPND32 
push ax ; save new EFLAGS value on stack 
OPND32 
popf ; replace current EFLAGS value 
OPND32 
pushf ; get new EFLAGS 
OPND32 
pop ax ; Store new EFLAGS in EAX 
OPND32 
xor ax, cx ; can't toggle AC bit, CPU=80386 
mov cpu_type, 3 ; turn on 80386 CPU flag 
mov So, b= ; restore original stack pointer 
jz end_get_cpuid ; jump if 80386 CPU 
and S68, not 3s ; align stack to avoid AC fault 
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OPND32 
push 
OPND3S2 
popf 
mov 


pes 
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; restore AC bit in EFLAGS first 
Sp, bx ; restore original stack pointer 


Intel486 DX CPU, Intel487 SX NDP, and Intel486 SX CPU check 
Checking for ability to set/clear ID flag (Bit 21) in EFLAGS 
which indicates the presence of a processor 


with the 
heck_80486 : 
mov 
OPND32 
mov 
OPND32 
OPND32 
push 
OPND32 
popf 
OPND32 
pushf 
OPND32 
pop 
OPND32 
xor 


je 


Execute 


’ 
’ 
; 
Cc 


heck_vendor: 
mov 
OPND32 
xor 
CPUID 
OPND32 
mov 
OPND32 
mov 
OPND32 
mov 
mov 
mov 
mov 

compare: 

repe 

or 

jnz 


intel_processor:;: 
mov 


ability to use the CPUID instruction. 


Cpu_type, 4 turn on 80486 CPU flag 


ax, Cx ; get original EFLAGS 
55h, 200000h flip (XOR) ID bit in EFLAGS 


ax save new EFLAGS value on stack 
; replace current EFLAGS value 
; get new EFLAGS 

ax ; Store new EFLAGS in EAX 


ax, ox >; can't toggle ID bit, 
end_get_cpuid : CPU=80486 


CPUID instruction to determine vendor, family, 


model and stepping. 


idfliac, ; set flag indicating use of CPUID inst. 


ax, ax ; set up input for CPUID instruction 
; macro for CPUID instruction 


word ptr vendor_id, bx ; setup to test for vendor id 
word ptr vendor_id[+4], dx 

word ptr vendor_id[+8], cx 

si, offset vendor_id 

di, 6ffset. intel. 2a 

cx, length intel_id 

cmpsb ; compare vendor id to "GenuinelIntel" 


Cx, Cx 
end_get_cpuid * if not zero, not an Intel CPU, 


intel_proc, l 
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cpuid_data: 
OPND32 
cmp ox, = make sure l is a valid input 
value for CPUID 
pak end_get_cpuid if not, jump to end 
OPND32 | 
xor ax, ax otherwise, use as input to CPUID 
OPND32 
inc ax and get stepping, model and family 
CPUID 
mov Stepping, al 
and Stepping, STEPPING_MASK ; isolate stepping info 
and al, MODEL_MASK ; isolate model info 
shr al, MODEL_SHIFT 
mov model, al 


and ax, FAMILY_MASK ; mask everything but family 
shr ax, FAMILY_SHIFT 
mov cpu_type, al ; set cpu_type with family 


OPND32 
mov feature_flags, dx save feature flag data 


end_get_cpuid: 
ret 
get_cpuid endp 


oRHRERAERAEKARKAERRAKHKARAAAKAEKAKAKEKKKAAKKKKLRKRAKKAKEKLHETARKS ESHER KE HES 


get_fpuid proc 

This procedure determines the type of FPU in a system 

and sets the fpu_type variable with the appropriate value. 
All registers are used by this procedure, none are preserved. 


we we we 


Coprocessor check 

The algorithm is to determine whether the floating-point 
Status and control words can be written to. If not, no 
coprocessor exists. If the status and control words can be 
written to, the correct coprocessor is then determined 
depending on the processor id. The Intel386 CPU can 

work with either an Intel287 NDP or an Intel387 NDP. 

The infinity of the coprocessor must be 

checked to determine the correct coprocessor id. 


we we we we we we we we we 


fninit ; reset FP status word 

mov fp_status, 5a5ah; initialize temp word to 
; non-zero value 

fnstsw fp_status ; Save FP status word 

mov ax, fp_status ; check FP status word 

cmp al, O ; see if correct status with 
; written 

mov fpu_type, 0 ; no fpu present 

jne end_get_fpuid 
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check_control_word: 


fnstcw 
mov 
and 


cmp 
mov 


jne 
mov 


; 80287/803587 


check_infinity: 
cmp 
jne 
fldl 
fldz 
fdiv 
vad Ue 
fchs 
fcompp 
fstsw 
mov 
mov 
sahf 
jz 
mov 

end_get_fpuid: 
ret 

get_fpuid endp 


fp_status 
ax, fp_status 
ax, 103fh 


ax, 3fh 
fpu_type, 0 


end_get_fpuid 
fpu_type, L 


check for the 


cpu_type, 3 
end_get_fpuid 


st 


fp_status 
ax, fp_status 
fpu_type, 2 


end_get_fpuid 
fpu_type,. 35 


we we we we we we 


AP-485 


save FP control word 

check FP control word 
see if selected parts 
looks OK 

check that l's & O's 

correctly read 


Intel386 CPU 


we ee we we we we we 


must use default control from FNINIT 
form infinity 

8087 and Intel287 NDP say +inf = -inf 
form negative infinity 

Intel387 NDP says +inf <> -inf 

see if they are the same and remove them 
look at status from FCOMPP 


; Store Intel287 NDP for fpu type 


see if infinities matched 
jump if 8087 or Intel287 is present 


; store Intel387 NDP for fpu type 


SPE ERLE REPRESS RASA LASSE SLES SS SESE SS PEARSE ST SESS VRE EASIER ESTER SSS CORE 


print proc 


This procedure prints the appropriate cpuid string and 


was Supported, this procedure prints out cpuid info. 


: numeric processor presence status. If the CPUID instruction 
9 


All registers are used by this procedure, none are preserved. 


cmp 


print_86: 
cmp 


idviae., - 1 


9 


print_cpuid_data 
ax, offset id_msg print initial message 


ah, 9h 
21h 


cpu_type, 0 


; if set. to 1, ¢pu supports 


CPUID instruction 


; print detailed CPUID information 


Example 1. CPU Identification Procedure (Continued) 
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jne 
mov 
mov 
int 
cmp 
je 

mov 
mov 
int 


jmp 


print_286: 


cmp 
jne 
mov 
mov 
int 
cmp 
je 

mov 
mov 
int 


jmp 


print_386: 


cmp 
jne 
mov 
mov 
int 
cmp 
je 

cmp 
jne 
mov 
mov 
int 
jmp 


print_587 ; 


mov 
mov 
int 


jmp 


print_486: 


cmp 
je 

mov 
mov 
int 


jmp 


mua. smrpatty a 


print_286 

dx, offset c8086 
ah, 9h 

21h 

fpu_type, 0 
end_print 

ax, offset fp_8087 
ah, 9h 

2lh 

end_print 


cpu_type, 2 
print_386 

dx, offset c286 
ah, 9h 

21h 

fpu_type, 0 
end_print 

dx, offset fp_80287 
ah, 9h 

2lh 

end_print 


cpu_type, 3 
print_486 

ax, offset c386 
ah, Qh 

21h 

fpu_type, 0 
end_print 
fpu_type, 2 
print_387 

dx, offset fp_80287 
ah, 9h 

21h 

end_print 


ax, offset fp_80387 
ah, 9h 

2lh 

end_print 


fpu_type, 0 
print_Intel1486sx 
ax, offset c486 
ah, 9h 

21h 

end_print 


print_Intel486sx: 
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mov 
mov 
int 
jmp 


ax, offset c486nfp 
ah, 9h 

2lh 

end_print 


Example 1. CPU Identification Procedure (Continued) 


oF EE eC ae cee ee 
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print_cpuid_data: 


cmp_vendor: 
cmp 
jne 


cmp 


jne 
mov 
mov 
int 
jmp 


check_Pentium: 
cmp 
jne 
mov 
mov 
int 


print_family: 
mov 
mov 
int 
mov 
mov 
add 
mov 
mov 
int 


print_model: 
mov 
mov 
int 
mov 
mov 
add 
mov 
mov 
int 


print_stepping: 
. mov 


mov 
int 
mov 
mov 
add 
mov 
mov 
int 


print_features: 
mov 
and 


"“20teL prea, 


not_GenuinelIntel 
Cpu_type, 4 


check_Pentium 

ax, offset Intel486_msg 
ah, 9h 

21h 

print_family 


Ccpu_type, 5 
print_features 

dx, offset Pentium_msg 
an,’ $n 


offset familymsg 
9h 


al, cpu_type 

byte ptr dataCR, al 
byte ptr dataCR, 30h 
ax, offset dataCR 
ah, 9h 

2lh 


dx, offset modelmsg 
ah, Qh 

2lh 

al, model 

byte ptr dataCR, al 
byte ptr dataCR, 30h 
ax, offset dataCR 
ah, 9h 

21h 


dx, offset steppingmsg 
ah, 9h 

2lh 

al, stepping 

byte ptr dataCR, al 
byte ptr dataCR, 30h 
dx, offset dataCR 

ah, 9h 

21h 


feature_flags 
FPU_FLAG 


; print family msg 


AP-485 


if cpu_type=4, print 


; Intel486 CPU message 


s 1f cpu_type=5, print 
; Pentium processor message 


; convert to ASCII 
; print family info 


; print model msg 


convert to ASCII 


; print model info 


; print stepping msg 


; convert to ASCII 
; print stepping info 


check for FPU 


Example 1. CPU Identification Procedure (Continued) 
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jz check_MCE 

mov offset fpu_msg 
mov 9h 

int 


check_MCE: 
mov feature_flags 
and ax, MCE_FLAG s check for MCE 
jz check_CMPXCHG8B 


mov ax, offset mce_msg 
mov ah, 9h 
int 2lh 


check_CMPXCHG8B: 
mov ax, feature_flags 
and ax, CMPXCHG8B_FLAG ; check for CMPXCHG8B 


jz end_print 

mov dx, offset cmp_msg 
mov ah, 9h 

int 21h 

jmp end_print 


not_GenuinelIntel: 
mov ax, offset not_Intel 
mov ah, 9h 
int 2lh 

end_print: 


ret 
print endp 


end 


Example 1. CPU Identification Procedure (Continued) 
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1.0 INTRODUCTION 


With the continuing demand for performance driving 
the need for high frequency microprocessors and infor- 
mation systems (i.e., the Pentium™ processor, i486 
microprocessors, and PCI Bus), designers of these com- 
plex systems are faced with the task of completing nu- 
merous system instructions within a shrinking window 
of time; from a 30 ns clock period @33 MHz to 15 ns 
@66 MHz. Many recent logic devices feature faster 
propagation delays, but this is not always enough. De- 


signers are now seeking relief from these timing con- 


straints by using higher quality clock driver devices, as 
well. At the current stage of advanced processor system 
development, clock signal timing accuracy is more im- 
portant than ever. Designers must be able to drive 
clocks to various regions of the board using special 
clock driver chips featuring extremely low output pin- 
to-output pin skew (Tgs). 


Intel programmable logic devices are a highly effective, 
low cost solution to the output skew minimization 
problem associated with high frequency clock distribu- 
tion. The 85C224 Programmable Logic Device is de- 
signed with extremely low output pin skew. In addition, 
it offers superior output signal quality including fast 
rise and fall times. Combined with the flexibility of pro- 
grammable logic the 85C224 can be programmed in a 
variety of clock driver configurations with maximum 
output pin skew of less than 400 ps. 


Unlike dedicated clock drivers that have limited flexi- 
bility, Intel’s 85C224 can be configured into different 
types of clock drivers. This flexibility allows the design- 
er to satisfy several clock driver needs with one device. 
In addition, while dedicated clock drivers can perform 
only one function (typically distribution or division), 
the flexible 85C224 satisfies programmable logic needs 
such as control signals and widespread glue logic needs. 
Only with a minimized output skew PLD like the 
85C224 can a designer implement clock distribution, 
division, and programmable logic with a single device. 


Not only is the 85C224 flexible, but it is also cost effec- 
tive. In comparison, dedicated clock driver devices with 
equivalent output skew range in cost of up to 10 times 
more than the 85C224! If the extrinsic value of the flex- 
ible programmable logic features were included in this 
cost comparison, the cost savings multiple would be 
even greater! 


For power intensive systems, like notebook PCs, low 
power consumption clearly becomes important. Fortu- 
nately, the 85C224 has always incorporated low power 
CMOS technology (typical _Ic¢c = 60 mA). The 
85C224 is an ideal clock driver for low power notebook 
_ PC’s and portable systems. a 
This Application Brief describes how the flexibility and 
extremely low output skews of the 85C224 meet the 
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clock driver needs of high clock speed systems. Two 
types of clock drivers are discussed, the simple gate 
driver or distributor and the frequency divider. 


2.0 CLOCK DISTRIBUTOR 


The 85C224 PLD performs as a simple gate driver or 
clock signal distributor when its outputs are pro- 
grammed to replicate an input clock. In this configura- 
tion, the input clock signal is replicated and driven 
from between | to 8 output pins (see Figure 1). 


From the outputs, the clock lines are then distributed 
to numerous regions of the system layout which require 
a synchronous version of the system clock as an input. 
Each of the output clocks has the same frequency and 
duty cycle as the original input clock. Output pin skew 
in this mode will typically be less than 250 ps for the 
low to high transition and 130 ps for the high to low 
transition (see Table 1). This configuration will drive/ 
distribute clocks up to a maximum clock frequency of 
133 MHz. A simple PLDasm program is shown below 
where the outputs (IO’s) follow the clock input (inp1): 


EQUATIONS 
‘2601 = tnpi 
iol. TRS =“Vece 
io2 = inpl 
162. TRST’ = V¥CC 


292117-1 


Figure 1. Gate Driver 


Design Recommendations: The lowest output skew can 
be obtained by defining Input 6 (pin 6) as the clock 
input of choice. This path has been determined by engi- 
neering to provide the lowest output pin skew. 


The best way to ensure a clean output clock with limit- 
ed ground bounce is to connect all unused input pins 
and optional ground pins to a clean ground plane. In 
addition, connect all optional power pins to a clean 
power supply. 
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The output skew that is noted in Table 1 was deter- 
mined with a configuration driving all 8 outputs and 
measuring worst case skew between all combinations of 
outputs. If less than eight output clocks are needed then 
the designer can attain even better output skew by se- 
lecting the optimal output pins which maintain progres- 
sively better output skew characterizations. For the low 
to high transition, the output pin skew decreases as 
used clock outputs are closer to Vcc, pin 28. In addi- 
tion, unused output pins should be tied to ground. For 
example, a 1 to 4 clock distribution using the first four 
I/Os (and grounding I/Os 5-8) provides output pin 
skew as low as 200 ps across all four outputs. 


Conversely, for the high to low transition the skew in- 
creases aS you move away from the ground pin. By 
grounding I/O1 and I/O8 the skew for the remaining 
pins are reduced to 100 ps. For systems requiring the 
least skew possible (< 100 ps), triggering from the fall- 
ing clock edge will provide the smallest clock skew 
achievable by the 85C224 (or any device on the mar- 
ket). 


3.0 FREQUENCY DIVIDER 


For systems that require multiple clock frequencies 
(i.e., 66 MHz and 33 MHz) or a clock signal with 50% 
duty cycle, a clock frequency divider can be used. By 
programming the outputs to toggle on the rising edge of 
the input clock, the 85C224 will divide the input fre- 
quency by two (see Figure 2). An example of a 
PLDasm frequency divider is shown below: 


Equations 
iol. sisi 
iol.TRST = VCC 
io? sa Jisl 
io2.TRST = VCC 


; Divide by 2 


292117-2 


Figure 2. Frequency Divider 
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In this mode the 85C224.can achieve typical output 
skews of 130 ps for the high to low transition and 
200 ps for the low to high transition (see Table 1). 
These superior output skews are achieved as a result of 
the close proximity of the toggling registers to the out- 
put pin. The reduced signal path in the divider configu- 
ration translates into a narrowing of the potential varia- 
tion in the output pin skew; hence, even less output pin 
skew. 


The maximum frequency of the 85C224-100 clock di- 
vider is 100 MHz input with a 50 MHz output. The 
85C224-10 will divide clock inputs up to 58 MHz and is 
offered at an even lower price. 


As with the clock distributor, the best way to insure a 
clean output clock signal and limited ground bounce is 
to connect all unused input pins, other than the clock 
input pin, (INPUT 1) to a clean ground plane. Also, 
connect the pins labeled “no connection” to ground 
and Pin 1 to Vcc. The skew increases for I/Os as they 
move farther from the Vcc pin. Conversely, for the 
high to low transition the skew increases for outputs 
farthest from the ground pin. This information is often 
useful when routing low skew priority signals. 


4.0 INVERTED OUTPUTS 


In addition to low output pin skews and a variety of 
clock driver configurations the 85C224 can be pro- 
grammed to invert any of the eight outputs. This fea- 
ture reduces skew and saves parts as it eliminates the . 
need to route a clock through an inverter chip to pro- 
vide the capability of triggering on the falling edge of 
the clock. The inversion is done with a negligible in- 
crease in output skew. From a software standpoint, the 
inversion can be accomplished by programming the 
“invert” bit in the PLD source code. In PLDasm, in- 
verting the signal is accomplished by using the active- 
low declaration “‘/” in front of the output signal that i 

to be inverted. 7 


5.0 RISE AND FALL TIMES 


The rise and fall times of the 85C224 meet the needs of 
high-speed system applications such as the PentiumTM 
microprocessor, while controlling ground bounce. Typ- 
ical rise times are under 1.4 ns and typical fall times are 
under 1000 ps. Rise and fall times were measured from 
0.8V to 2.0V with a 50 pF load. 
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6.0 TEMPERATURE DEPENDENCE 


As the temperature approaches 85°C, the output skew 
of the 85C224 remains under 500 ps. In combinatorial 
logic, the output skew will increase to typical values of 
350 ps for the low to high transition and 230 ps for the 
high to low transition. In registered mode the skew at 
85°C is typically 200 ps for the low to high transition 
and 250 ps for the high to low transition. 


7.0 POWER SAVINGS 


The 85C224 is perfect for laptop system as the 1-mi- 
cron CHMOS IIIE EPROM process allows it to con- 
sume less power than bipolar PALs and even CMOS 
GALS. Since there is an output enable P-term for every 
macro cell, a power management scheme can be pro- 
grammed into the array that will disable macrocells ac- 
cording to the power requirements of the system. In 
this case, the 85C224 can perform three functions: pow- 


er management, clock distribution and division, further 


demonstrating the high utility and flexibility of this de- 
vice. 


8.0 TOOLS AND SUPPORT 


Complete design files and JEDEC files associated with 
this brief can be downloaded via the PLDshell Plus 
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BBS. The number is (916) 985-2308. You must be a 
registered PLDshell Plus user to log in. 


PLDshell Plus software is available from Intel Litera- 
ture 1-800-548-4725 or your local Intel sales office. 


9.0 CONCLUSION 


The 85C224 is an excellent choice for low skew clock 
driver applications. The PLD offers flexibility which 
allows the designer to configure the device into an as- 
sortment of clock driving implementations: distribu- 
tion, division, and inversion. The device fundamentally 
provides low power and programmable logic function- 
ality to meet the needs of control logic, state machines, 
and power management—all with a single device, the 
85C224. The extremely low skews offered by the 
85C224 are as good and better than comparable single 
function clock drivers; for a fraction of the cost! 


High performance, low power, minimal output skew, 
programmable capabilities, low cost and overall flexi- 
bility make the 85C224 an intelligent solution for your 
design needs. 
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Figure 3. Measurements 
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Table 1. 85C224-10 Characteristic Data 


Ta = 25°C Ta = 75°C 
Parameter CL = SOpF | C. = 50pF 
Typ | Max | Typ | Max 
Comb tosHi- Combinatorial I/O to 1/0 Skew 5.0V | 130 175 | 
High to Low Transition 
Comb tos_H Combinatorial I/O to I/O Skew 5 OV 400 
Low to High Transition 
Reg tosHi Registered I/O to /0 Skew 50vV | 150 
High to Low Transition 
Reg tosi_H Registered I/O to |/O Skew 5 OV 
Low to High Transition 


CC ee Es 
/t; «| Fall Time (0.8V-2.0V) 1.0 eqeerars 
62.5 


fMAX Max. Counter Freq. 1/tcnt 
ee Internal Feedback 5.0V MHz 


Tim | 950204-100 ternal Feccback iViay [sav] [16] | 18 | wre 
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Tel: (358) 0 544 644 
FAX: (358) 0 544 030 


! 
FRANCE 


Intel Corporation S.A.R.L. 

1, Rue Edison-BP 303 

78054 St. Quentin-en-Yvelines 
Cedex 

Tel: (33) (1) 30 57 70 00 

FAX: (33) (1). 30 64 60 32 


intel. 


GERMANY 


Intel GmbH 

Dornacher Strasse 1 

85622 Feldkirchen/Muenchen 
Tel: (49) 089/90992-0 

FAX: (49) 089/9043948 


ISRAEL 
Intel Semiconductor Ltd. 


Atidim Industrial Park-Neve Sharet 


P.O. Box 43202 
Tel-Aviv 61430 

Tel: (972) 03 498080 
FAX: (972) 03 491870 


ITALY 


Intel Corporation Italia S.p.A. 
Milanofiori Palazzo E 

20094 Assago 

Milano 

Tel: (39) (2) 575441 

FAX: (39) (2) 3498464 


NETHERLANDS 


Intel Semiconductor B.V. 
Postbus 84130 

3009 CC Rotterdam 

Tel: (31) 10 407 11 11 
FAX: (31) 10 455 4688 


EUROPEAN SALES OFFICES 


RUSSIA 


Intel Technologies, Inc. 
Krementshugskaya 6/7 
121357 Moscow 

Tel: 007-095-4439785 

FAX: 007-095-4459420 
TLX: 612092 smail su. 


SPAIN 


Intel Iberia S.A. 
Zubaran, 28 

28010 Madrid 

Tel: (34) (1) 308 2552 
FAX: (34) (1) 410 7570 


SWEDEN 


Intel Sweden A.B. 
Dalvagen 24 

171 Soina 
Tel: (46) 8 705 5600 
FAX: (46) 8 278085 


UNITED KINGDOM 


Intel Corporation (U.K.) Ltd. 
Pipers Way 
Swindon, Wiltshire SN3 1RJ 
Tel: (44) (0793) 696000 
FAX: (44) (0793) 641440 


EUROPEAN DISTRIBUTORS/REPRESENTATIVES 


AUSTRIA 


t*Elbatex GmbH 
Eitnergasse 6 
A-1231 Wien 

Tel: (43) 1816020 
FAX: (43) 181652141 


tSpoerle Electronic 
Heiligenst. Str. 62 
A-1190 Wien 

Tel: (43) 1 318 72 700 
FAX: (43) 1 369 22 73 


BELGIUM 


t*Inelco Distribution 

Avenue des Croix de Guerre 94 
1120 Bruxelles 

Tel: (32) 2 244 2811 

FAX: (32) 2 216 3304 


*Diode Belgium 

2 ll, Minervastraat, 14/B2 
1930 Zaventem 

Tel: (32) 2 725 46 60 

FAX: (32) 2 725 45 11 


DENMARK 


*Avnet Nortec A/S 
Transformervej 17 
DK-2730 Herlev 

Tel: (45) 4284 2000 
FAX: (45) 4492 1552 


t*ITT Multikomponent AS 
Naverland 29 

DK-2600 Glostrup 

Tel: (45) 4245 6645 

FAX: (45) 4245 7624 


FINLAND 


t*OY Fintronic AB 
Pyyntitie, 3 

02230 Espoo 

Tel: (358) 0 887 331 
FAX: (358) 0 887 33 343 


FRANCE 


*Arrow Electronique 
73-79 Rue des Solets 
Silic 585 

94663 Rungis Cedex 
Tel: (33) (1) 4978 4978 
FAX: (33) (1) 4978 0596 


*Avnet 

79, rue Pierre Semard 
92322 Chatillon 

Tel: (33) (1) 4965 2500 
FAX: (33) (1) 4965 2769 


tMetrologie 

Tour d’Asnieres 

4, Avenue Laurent Cely 
92606 Asnieres Cedex 

Tel: (33) (1) 4080 9000 
FAX: (33) (1) 4791 0561 


*Tekelec 

Cite des Bruyeres 

5, Rue Carle Vernet-BP 2 
92310 Sevres 

Tel: (33) (1) 4623 2425 
FAX: (33) (1) 4507 2191 


*Components 
tSystems 


GERMANY 


*Avnet Electronic 2000 
Stahigruberring 12 
81829 Muenchen 

Tel: (49) 89 45110-01 
FAX: (49) 89 45110129 


*Jermyn GmbH 

Im Dachsstueck 9 
65549 Limburg 

Tel: (49) 6431 5080 
FAX: (49) 6431 508289 


tMetrologie GmbH 
Steinerstrasse 15 
81369 Muenchen 

Tel: (49) 89 724470 
FAX: (49) 89 72447111 


*Proelectron Vertriebs GmbH 
Max-Planck-Strasse 1-3 
63303 Dreieich 

Tel: (49) 6103 304343 

FAX: (49) 6103 304425 


tRein Elektronik GmbH 
Loetscher Weg 66 
41303 Nettetal 

Tel: (49) 2153 7330 
FAX: (49) 2153 733513 


GREECE 


tErgodata 
Aigiroupoleos 2A 

176 76 Kalithea 

Tel: (30) 1 95 10 922 
FAX: (30) 1 95 93 160 


*Pouliadis Associates Corp. 
Aristotelous St. 3/Sygrou Av. 150 
Athens 17671 

Tel: (30) 1 924 2072 

FAX: (30) 1 924 1066 


IRELAND 


t*Micro Marketing 
Taney Hall 

Eglinton Terrace 
Dundrum 

Dublin 14 

Tel: (353) (1) 298 9400 
FAX: (353) (1) 298 9828 


ISRAEL 


t*Eastronics Limited 
Rozanis 11 

P.O.B. 39300 

Tel Baruch 

Tel-Aviv 61392 

Tel: (972) 3 6458 777 
FAX: (972) 3 6458 666 


ITALY 


*Intesi Div. Della Deutsche 
Divisione ITT Industries GmbH 
P.|. 06550110156 

Milanofiori Palazzo e5 

20094 Assago (Milano) 

Tel: (39) 2 824701 

FAX: (39) 2 8242631 


*Lasi Elettronica 

P.1. 00839000155 

Viale Fulvio Testi, N.280 
20126 Milano 

Tel: (39) 2 661431 

FAX: (39) 2 66101385 


tOmnilogic Telcom 
Via Lorenteggio 270/A 
20152 Milano 

Tel: (39) 2 48302640 
FAX: (39) 2 43802010 


NETHERLANDS 


tDatelcom B.V. 
Meidoornkade 22 
3993 AE Houten 

Tel: (31) 3403 57222 
FAX: (31) 3403 57220 


*Diode Components 
Coltbaan 17 

3439 NG Nieuwegein 
Tel: (31) 3402 9 12 34 
FAX: (31) 3402 3 59 24 


t*Koning en Hartman 
Energieweg 1 

2627 AP Delft 

Tel: (31) 15 609 906 
FAX: (31) 15 619 194 


NORWAY 


*Avnet Nortec A/S 
Postboks 123 
N-1364 Hvalstad 
Tel: (47) 284 6210 
FAX: (47) 284 6545 


tComputer System Integration A/S 
Postbox 198 

N-2013 Skjetten 

Tel: (47) 638 45 411 

FAX: (47) 638 45 310 


PORTUGAL 


*ATD Electronica LDA 
Edificio Altejo 

Rua 3 piso 5-sala 505 
Urbanizacao de Matinha 
1900 Lisboa 

Tel: (351) (1) 858 0191 /2 
FAX: (351) (1) 858 7841 


tMetrologia Iberica Portugal 

Rua Dr. Faria de Vasconcelos 3A 
1900 Lisboa 

Tel: (351) (1) 847 2202 

FAX: (351) (1) 847 2197 


SOUTH AFRICA 


T*EBE 

PO Box 912-1222 
Silverton 0127 

178 Erasmus Street 
Meyerspark 

Pretoria 0184 

Tel: (27) 12 803 7680-93 
FAX: (27) 12 803 8294 


SPAIN 


*ATD Electronica 

Avenue de la Industria, 32, 2B 
28100 Alcobendas 

Madrid 

Tel: (34) (1) 661 6551 

FAX: (34) (1) 661 6300 


tMetrologia Iberica 
Avda. Industria, 32-2 
28100 Alcobendas 
Madrid 

Tel: (34) (1) 661 1142 
FAX: (34) (1) 661 5755 


SWEDEN 


tAvnet Computer AB 
Box 184 

S-123 23 Farsta 

Tel: (46) 8 705 18 00 
FAX: (46) 8 735 2373 


*Avnet Nortec AB 
Box 1830 

S-171 27 Solna 
Tel: (46) 8705 1800 
FAX: (46) 883 6918 


*ITT Multikomponent AB 
Ankdammsgatan 32 
Box 1330 

S-171 26 Solna 

Tel: (46) 8 830020 

FAX: (46) 8 27 13 03 


SWITZERLAND 


tElbatex AG 

Hardstr. 7 

CH-5430 Wettingen 
Tel: (41) 56 27 50 00 
FAX: (41) 27 19 24 


tFabrimex AG 
Kirchenweg 5 
CH-8032 Zurich 

Tel: (41) 1 386 86 86 
FAX: (41) 1 383 23 79 


tiMIC Microcomputer 
Zurichstrasse 
CH-8185 Winkel-Ruti 
Tel: (41) (1) 8620055 
FAX: (41) (1) 8620266 


t*Industrade AG 
Hertistrasse 31 
CH-8304 Wallisellen 
Tel: (41) (1) 8328111 
FAX: (41) (1) 8307550 


TURKEY 


*Empa Electronic 
Florya Is Merkezi 
Besyol Londra Asfalti 
34630 Florya Istanbul 
Tel: (90) (1) 599 3050 
FAX: (90) (1) 599 3061 


UNITED KINGDOM 


*Arrow Electronics 

St. Martins Business Centre 
Cambridge Road 

Bedford - MK42 OLF 

Tel: (44) 234 270272 

FAX: (44) 234 211434 


*Avnet EMG Ltd. 

Jubilee House 

Jubilee Road 

Letchworth 

Hertsfordshire - SG6 1QH 
Tel: (44) 462 488 500 
FAX: (44) 462 488 567 


*Bytech Components 
12a Cedarwood 
Chineham Business Park 
4 Crockford Lane 
Basingstoke 

Hants RG12 1RW 

Tel: (44) 256 707 107 
FAX: (44) 256 707 162 


tBytech Systems 

5 The Sterling Centre 
Eastern Roa 

Bracknell 

Berks - RG12 2PW 
Tel: (44) 344 55 333 
FAX: (44) 344 867 270 


*Datrontech 

42-44 Birchett Road 
Aldershot 

Hants —-GU11 1LU 
Tel: (44) 252 313155 
FAX: (44) 252 341939 


*Jermyn Electronics 
Vestry Estate 

Otford Road 
Sevenoaks 

Kent TN14 5EU 

Tel: (44) 732 743 743 
FAX: (44) 732 451 251 


tMetrologie VA 

Rapid House 

Oxford Road 

High Wycombe 

Bucks - HP11 2E 

Tel: (44) 494 526 271 
FAX: (44) 494 421 860 


*MMD/Rapid Ltd. 
Rapid Silicon 

3 Bennet Court 
Bennet Road 

Reading 

Berks - RG2 0QX 

Tel: (44) 734 750 697 
FAX: (44) 734 313 255 
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Microprocessors: Volume III 


In 1993, Intel introduced the Pentium’ Processor, the fifth- 
generation processor based on the Intel Architecture. The Pentium 


“Ss 


processor represents the next generation of power for high-end 
desktop and server performance. The Pentium processor is fully 
code compatible with previous members of the Intel Architecture 
family of microprocessors, thus preserving the value of user’s 
software investments. Up to five times as powerful as the 33-MHz 
Intel486° DX CPU, the Pentium processor extends the Intel 
processor performance continuum while maintaining full 
compatibility with existing software. 


The Pentium processor employs the most advanced technology and 
engineering innovation and is the enabling technology for today’s 
high-end and tomorrow’s emerging applications. The Pentium 
processor incorporates a superscalar architecture, improved floating 
point unit, separate on-chip code and data caches, 64-bit external 
data bus, and other features designed to provide an architectural 
platform for high-performance computing. 


This handbook contains information regarding the design of 
Pentium processor-based systems, including secondary cache 
subsystems, local bus systems based on the PCI specification, and 
Advanced Programmable Interrupt Controller (APIC) designs. 


ISBN 1-55512e-194-5 
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